| Literature DB >> 20370926 |
Ken Tatebe1, Ahmet Zeytun, Ruy M Ribeiro, Robert Hoffmann, Kevin S Harrod, Christian V Forst.
Abstract
BACKGROUND: The recent emergence of the H5N1 influenza virus from avian reservoirs has raised concern about future influenza strains of high virulence emerging that could easily infect humans. We analyzed differential gene expression of lung epithelial cells to compare the response to H5N1 infection with a more benign infection with Respiratory Syncytial Virus (RSV). These gene expression data are then used as seeds to find important nodes by using a novel combination of the Gene Ontology database and the Human Network of gene interactions. Additional analysis of the data is conducted by training support vector machines (SVM) with the data and examining the orientations of the optimal hyperplanes generated.Entities:
Mesh:
Year: 2010 PMID: 20370926 PMCID: PMC2868837 DOI: 10.1186/1471-2105-11-170
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Significant GO nodes
| Significant GO Nodes at 24 Hours | ||
|---|---|---|
| AIM2 | absent in melanoma 2 | 2.71 |
| CCL5 | chemokine (C-C motif) ligand 5 | 1.64 |
| CHST4 | carbohydrate (N-acetylglucosamine 6-O) sulfotransferase 4 | 1.53 |
| EDG6 | endothelial differentiation, lysophosphatidic acid GPCR 6 | -1.22 |
| EXO1 | exonuclease 1 | 0.08 |
| GBP1 | guanylate binding protein 1, interferon-inducible, 67 kDa | 1.75 |
| GBP4 | guanylate binding protein 4 | 1.25 |
| OAS1 | 2',5'-oligoadenylate synthetase 1, 40/46 kDa | 1.91 |
| OAS2 | 2',5'-oligoadenylate synthetase 2, 69/71 kDa | 2.36 |
| OAS3 | 2',5'-oligoadenylate synthetase 3, 100 kDa | 2.68 |
| SPON2 | spondin 2, extracellular matrix protein | -1.29 |
| TAP1 | transporter 1, ATP-binding cassette, sub-family B (MDR/TAP) | 1.63 |
| CCNA2 | cyclin A2 | 1.58 |
| CCNE2 | cyclin E2 | 2.32 |
| CDC6 | CDC6 cell division cycle 6 homolog ( | 1.86 |
| E2F1 | E2F transcriptoin factor 1 | 1.37 |
| MCM6 | MCM6 minichromosome maintenance deficient 6 | 2.05 |
| PRC1 | protein regulator of cytokinesis 1 | 1.17 |
| SPC25 | kinetochore protein | 2.15 |
| TXNIP | thioredoxin interacting protein | 1.29 |
| UHRF1 | ubiquitin-like, containing PHD and RING finger domains, 1 | 1.63 |
| CCL5 | chemokine (C-C motif) ligand 5 | 1.64 |
| IFIH1 | interferon induced with helicase C domain 1 | 1.87 |
| IRF7 | interferon regulatory factor 7 | 1.76 |
| MX1 | myxovirus (influenza virus) resistance 1, interferon-inducible prot. p78 (mouse) | 2.57 |
| MX2 | myxovirus (influenza virus) resistance 2 (mouse) | 2.43 |
| OAS1 | 2',5'-oligoadenylate synthetase 1, 40/46 kDa | 1.91 |
| STAT1 | signal transducer and activator of transcription 1, 91 kDa | 1.66 |
| MCM6 | MCM6 minichromosome maintenance deficient 6 | 2.05 |
| MCM10 | MCM10 minichromosome maintenance deficient 10 ( | 2.10 |
The most significant GO nodes are given with their raw p-values. Under each GO entry is a list of the genes associated with that node.
Figure 1Network of significant genes and GO nodes at 24 hours. The Gene Ontology nodes found to be significant in the BiNGO analysis are shown as rectangles, with the more orange nodes being more statistically significant. The genes associated with the GO nodes are listed in ovals connected by black arrows to the GO nodes. These genes are further connected to other genes in the Human Network via yellow and orange edges. Red ovals indicate up-regulated genes while blue indicates down-regulated genes (see Additional files 1 &2).
Top 3 most strongly regulated genes at 8 and 24 hours
| Gene | 8 hrs | 24 hrs |
|---|---|---|
| EGR2 | -3.65 ± 0.10 | -3.47 ± 0.00 |
| FOS | -3.47 ± 0.00 | -3.25 ± 0.16 |
| EGR1 | -3.32 ± 0.00 | -3.10 ± 0.08 |
Errors are based on the variation of the fold change in expression between the three independent trials. EGR2 has the greatest average down-regulation of all genes observed in this study at both 8 and 24 hours with FOS being more down-regulated than EGR1 at 8 hours and FOS and EGR1 switching places at 24 hours. The log2 fold changes are shown.
Definition of genes referred to in the main text
| Gene | Definition |
|---|---|
| ABCC3 | ATP-binding cassette, sub-family C (CFTR/MRP), member 3 |
| CCL2 | chemokine (C-C motif) ligand 2 |
| CDC6 | cell division cycle 6 homolog ( |
| CTGF | connective tissue growth factor |
| CXCL11 | chemokine (C-X-C motif) ligand 11 |
| CYR61 | cysteine-rich, angiogenic inducer, 61 |
| DUSP1 | dual specificity phosphatase 1 |
| EGR1/2 | early growth response 1/2 |
| FOS | v-fos FBJ murine osteosarcoma viral oncogene homolog |
| FOSB | FBJ murien osteosarcoma viral oncogene homolog B |
| IFI27 | interferon, alpha-inducible protein 27 |
| IFIT5 | interferon-induced protein with tetratricopeptide repeats 5 |
| IL17C | interleukin 17C |
| ISG15 | ISG15 ubiquitin-like modifier |
| JUN | v-jun sarcoma virus 17 oncogene homolog ( |
| LGALS2 | lectin, galactoside-binding, soluble, 2 |
| MCM6 | minichromosome maintenance deficient 6 |
| PLK3 | polo-like kinase 3 ( |
| PTGS2 | prostaglandin-endoperoxide synthase 2 |
| SDPR | serum deprivation response (phosphatidylserine binding protein) |
| SOCS1/3 | suppressor of cytokine signaling 1/3 |
| STAT1 | signal transducer and activator of transcription 1, 91 kDa |
| STAT3 | signal transducer and activator of transcription 3 (acute-phase response factor) |
| TLR | toll-like receptor |
| TAP1 | transporter 1, ATP-binding cassette, sub-family B (MDR/TAP) |
| ZFP36 | zinc finger protein 36, C3H type, homolog ( |
Figure 2Network of genes using daughters of BiNGO nodes as seeds. Genes that are up-regulated are shown in red and those in blue are down-regulated. Any gene found to be a daughter of a GO node found to be significant under the BiNGO analysis was included in a seed set used to generate the resulting network connecting the BiNGO nodes. In this case, the seed genes are not necessarily significant, rather, it is their associated GO node that is significant (see Additional files 3 &4).
Genes common to significance and SVM analyses
| SVM Gene | Description (if available) | 8 hrs | 24 hrs | |
|---|---|---|---|---|
| CYR61 | -1.51 | cysteine-rich, angiogenic inducer, 61 | + | + |
| SOCS3 | -1.06 | suppressor of cytokine signaling 3 | - | + |
| AK024238 | -1.32 | mRNA, possibly cadherin 6, type 2 (CDH6) | + | + |
| AA768672 | -1.47 | mRNA, possibly LSM14A | + | + |
| THC2348879 | -1.22 | + | - | |
| THC2313287 | -2.74 | + | + | |
| IER2 | -1.64 | immediate early response 2 | + | + |
| ZFP36 | -1.74 | zinc finger protein 36 | + | + |
| FOS | -3.47 | v-fos FBJ murine osteosarcoma viral oncogene homolog | + | + |
| NFKBIE | 1.08 | nuclear factor of kapppa light polypeptide gene enhancer in B-cells inhibitor | + | + |
| AT_nC_3 | -1.32 | + | - | |
| CTGF | -1.43 | connective tissue growth factor | + | - |
| AF159295 | -1.89 | mRNA, MAP/microtubule affinity-regulating kinase 3 (MARK3) | + | + |
| JUN | -1.74 | v-jun sarcoma virus 17 oncogene homolog ( | + | + |
| FOSB | -2.18 | FBJ murien osteosarcoma viral oncogene homolog B | + | + |
| EGR1 | -3.18 | early growth response 1 | + | + |
| HP1BP3 | -1.00 | heterochromatin protein 1, binding protein 3 | + | - |
| DUSP1 | -1.79 | dual specificity phosphatase 1 | + | + |
| ABCC3 | 1.38 | ATP-binding cassette, sub-family C (CFTR/MRP), member 3 | + | + |
| FOXC1 | -1.15 | forkhead box C1 | - | + |
| SFRS5 | -1.47 | splicing factor, arginine/serine-rich 5 | + | + |
| DB363693 | -1.43 | + | - | |
| THC2343678 | -1.47 | + | + | |
| A_24_P161068 | -1.09 | - | + |
Genes are listed in order of importance in the SVM analysis. The fold change in each gene is the average fold change in gene expression including both the 8 and 24 hour data. In the last two columns, a "+" indicates that the gene was found in both the Significance and SVM Analyses at 8 or 24 hours, respectively. A "-" indicates that the gene was not found in the results in both analyses at 8 or 24 hours. Genes through sfrs5 are among the top 35 and through the end of the list are in the top 43.
Components of the Optimal Hyper Plane (OHP) in the SVM analysis
| SVM Components of Significant Genes | |||||
|---|---|---|---|---|---|
| CYR61 | -1.51 | 0.16 | 2.8 × 10-6 | 0.31 | -.0098 |
| SOCS3 | -1.06 | -0.07 | 0.0017 | 0.39 | -0.0095 |
| LOC196752 | -0.94 | 0.11 | 0.01 | 0.6 | -0.0094 |
| ALDH3B1 | -0.86 | 0.07 | 0.012 | 0.9 | -0.0094 |
| KLF6 | -1.60 | 0.26 | 5 × 10-7 | 0.38 | -0.0093 |
| AK024238 | -1.32 | 0.06 | 7.6 × 10-7 | 0.39 | -0.0093 |
| AA768672 | -1.47 | -0.30 | 6.9 × 10-7 | 0.3 | -0.0093 |
| TUBAL3 | -1.22 | -0.62 | 0 | 0.078 | -0.0092 |
| THC2348879 | -1.22 | -0.09 | 1.4 × 10-6 | 0.41 | -0.0088 |
| THC2313287 | -2.74 | -0.09 | 4.1 × 10-13 | 0.52 | -0.0088 |
| IER2 | -1.64 | 0.07 | 1.8 × 10-6 | 0.61 | -0.0088 |
| ZFP36 | -1.74 | 0.10 | 3.7 × 10-7 | 0.67 | -0.0086 |
| FOS | -3.47 | 0.10 | 1.4 × 10-19 | 0.6 | -0.0086 |
| AHNAK | -0.94 | 0.15 | 0.012 | 0 | -0.0085 |
| NFKBIE | 1.08 | -0.09 | 0 | 0.37 | 0.0085 |
| A_24_P7785 | -0.97 | -0.12 | 0.013 | 0.4 | -0.0085 |
| AT_nC_3 | -1.32 | -0.20 | 0.0067 | 0.78 | -0.0084 |
| CTGF | -1.43 | 0.03 | 2 × 10-8 | 0.37 | -0.0084 |
| AT_ssH_RR_5 | -1.56 | -0.14 | 1.3 × 10-6 | 0.26 | -0.0083 |
| AF159295 | -1.89 | 0.26 | 2.5 × 10-7 | 0.3 | -0.0083 |
| JUN | -1.74 | -0.20 | 7.3 × 10-7 | 0.26 | -0.0083 |
| FOSB | -2.18 | 0.26 | 1.2 × 10-9 | 0.58 | -0.0082 |
| SYT12 | 0.88 | -0.09 | 0.012 | 0.93 | 0.0082 |
| AK022038 | -0.89 | 0.26 | 0.013 | 0.35 | -0.0082 |
| EGR1 | -3.18 | 0.21 | 1.9 × 10-18 | 0.44 | 0.0081 |
| A_24_P494658 | -0.86 | 0.08 | 0.022 | 0.25 | -0.008 |
| HP1BP3 | -1.00 | -0.04 | 0.0033 | 0.62 | -0.008 |
| ENST00000380874 | -0.97 | -0.42 | 0.0033 | 0.21 | -0.0079 |
| TRAF2 | -0.77 | -0.17 | 0.025 | 0.31 | 0.0078 |
| DUSP1 | -1.79 | 0.43 | 2.2 × 10-7 | 0.27 | -0.0078 |
| RELB | 1.14 | 0.15 | 0 | 0.29 | 0.0078 |
| ABCC3 | 1.38 | 0.03 | 1.9 × 10-6 | 0.23 | 0.0078 |
| FOXC1 | -1.15 | 0.01 | 0 | 0.72 | -0.0077 |
| SFRS5 | -1.47 | 0.41 | 7.1 × 10-9 | 0.31 | -0.0077 |
| G0S2 | 2.38 | 0.42 | 1.5 × 10-8 | 0.22 | 0.0076 |
The components of the OHP in the SVM analysis are displayed along with the average fold change of the gene in the H5N1 data and control data, and the average p-values in the H5N1 and control data. The length of each component is also given. A negative component to the vector normal to the OHP indicates a decrease in the fold change in gene expression from the control to the H5N1 exposures while a positive component indicates an increase in the differential expression from control to H5N1.
Figure 3Pathway analysis of major gene products affected by virulent infection. A sub-network of the response network from Figure 2 reveals interactions between the major gene products affected by virulent infections. This network is in accordance with Figure 5 of Djavani et al. (2007) [16]. In contrast to Djavani et al., PTGS2 and IL1RL1 (not shown: as well as IL1R1 and IL1R2) are up-regulated. EGR1, EGR2, FOS, FOSB are transcription factors. PTGS2 encodes prostaglandin-endoperoxide synthase 2, IL1A/B code for interleukin 1 and IL1RL1 belongs to the interleukin 1 receptor family. Color coding is according to Figure 1. Black arrows indicate known connections not realized in this particular response network (see Additional files 5 &6).
Figure 4Differential expression network between H5N1 and RSV. Response network of differentially expressed genes between H5N1 and RSV. (a) A response network H5N1 versus RSV at 8 h is shown with 54 nodes and 65 interactions. The network was calculated with parameters k = 3 and l = 4.5. (b) A H5N1 versus RSV response network at 24 h is depicted with 53 nodes and 71 interactions. Parameters k = 3 and l = 5 were used for calculation. Color coding is according to Figure 1. Edges with arrows indicate chemical reactions, diamond-shaped edge tips denote activation, circle-shaped tips refer to phorphorylation reactions and non-decorated edges are non-directional interactions by physical binding or by inferrence from the iHOP database (see Additional files 7, 8, 9 and 10).
Most significant GO nodes
| Significant GO Nodes at 24 Hours | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2376 | 2.4009E-16 | 1.8007E-13 | 35 | 783 | immune system | IFIH1, IFITM1, IFITM3, G1P3, OAS3, PRKR | ||||||
| process | TLR2, ISG15, OAS1, C1S, OAS2, ISGF3G | |||||||||||
| CXCL11, CCL5, HSH2D, BF, PGLYRP4 TAP1 | ||||||||||||
| MX1, SPON2, MX2, EDG6, EXO1, EGR1, ZFP36 | ||||||||||||
| CHST4, HLA-DQA2, AIM2, PDCD1LG1, IKBKE, CD34 | ||||||||||||
| IRF7, IFIT5, DMBT1, GBP1 | ||||||||||||
| 6955 | 6.2296E-16 | 4.6722E-13 | 31 | 613 | immune | IFIH1, IFITM1, IFITM3, TLR2, G1P3, OAS3 | ||||||
| response | PRKR, ISG15, OAS1, C1S, OAS2, ISGF3G | |||||||||||
| CXCL11, CCL5, BF, PGLYRP4, TAP1, MX1 | ||||||||||||
| SPON2, MX2, EDG6, EXO1, CHST4, HLA-DQA2, AIM2 | ||||||||||||
| PDCD1LG1, IKBKE, IRF7, IFIT5, DMBT1, GBP1 | ||||||||||||
| 51869 | 4.4363E-13 | 3.3273E-10 | 54 | 2335 | response to | PIR51, PRKR, G1P3, TLR2, ISG15, ISGF3G | ||||||
| stimulus | CXCL11, CALB1, FOS, PGLYRP4, MX1, SPON2 | |||||||||||
| FANCA, CCNA2, MX2, IHPK3, CYR61, RAMP | ||||||||||||
| EDG6, ZFP36, ECGF1, SAA4, CHST4, FOSB | ||||||||||||
| HLA-DQA2, RAD51, PDCD1LG1, UHRF1, GADD45B | ||||||||||||
| PROS1, GBP1, IFIH1, IFITM1, IFITM3 | ||||||||||||
| OAS3, OAS1, CHEK1, OAS2, C1S, CCL5 | ||||||||||||
| BF, TYMS, TAP1, EXO1, GSTA3, SGK, STAT1 | ||||||||||||
| ABCG1, AIM2, IKBKE, DUSP1, IRF7, IFIT5, DMBT1 | ||||||||||||
| 7049 | 1.8839E-10 | 1.4129E-7 | 28 | 808 | cell cycle | E2F1, PRC1, IFITM1, HCAP-G, PRKR, CHEK1 | ||||||
| MCM10, CCNE2, SPC25, CDC45L, CCNA2 | ||||||||||||
| EXO1, CDC6, MKI67, NUSAP1, FOSB, STAT1 | ||||||||||||
| MCM6, RAD51, UHRF1, PLK3, DUSP1 KNTC2 | ||||||||||||
| JUN, TOPK, FOXC1, GADD45B, DMBT1 | ||||||||||||
| 22402 | 4.4621E-9 | 3.3466E-6 | 24 | 690 | cell cycle | EXO1, E2F1, CDC6, IFITM1, MKI67, PRC1 | ||||||
| process | HCAP-G, NUSAP1, CHEK1, FOSB, MCM10, STAT1 | |||||||||||
| RAD51, CCNE2, SPC25, PLK3, CDC45L, KNTC2 | ||||||||||||
| JUN, TOPK, FOXC1, GADD45B, CCNA2, DMBT1 | ||||||||||||
| 9615 | 5.9766E-9 | 4.4825E-6 | 10 | 91 | response | IFIH1, IRF7, PRKR, ISG15, OAS1, ISGF3G | ||||||
| to virus | CCL5, MX1, STAT1, MX2 | |||||||||||
| 6260 | 8.9240E-9 | 6.6930E-6 | 13 | 188 | DNA | EXO1, CDC6, ECGF1, MCM10, ORC1L, TK1 | ||||||
| replication | RAD51, MCM6, CCNE2, TYMS, CDC45L, PFS2, RAMP | |||||||||||
| 9607 | 2.1548E-8 | 1.6161E-5 | 14 | 241 | response to | IFIH1, IFITM1, IFITM3, PRKR, ISG15, OAS1 | ||||||
| biotic stimulus | ISGF3G, STAT1, CCL5, PGLYRP4, IRF7, MX1 | |||||||||||
| MX2, DMBT1 | ||||||||||||
| 51707 | 5.8349E-8 | 4.3762E-5 | 12 | 182 | response to | IFIH1, PGLYRP4, IRF7, PRKR, ISG15, OAS1 | ||||||
| other organism | ISGF3G, CCL5, MX1, STAT1, MX2, DMBT1 | |||||||||||
| 6950 | 5.9053E-7 | 4.4290E-4 | 24 | 894 | response to | EXO1, ZFP36, GSTA3, PIR51, SGK, TLR2 | ||||||
| stress | SAA4, CHST4, CHEK1, C1S, CCL5, CXCL11, RAD51 | |||||||||||
| BF, TYMS, FOS, UHRF1, DUSP1, GADD45B, PROS1 | ||||||||||||
| CCNA2, FANCA, IHPK3, RAMP | ||||||||||||
| 74 | 1.7011E-6 | 1.2758E-3 | 17 | 506 | reg. of prog. | E2F1, CDC6, MKI67, IFITM1, NUSAP1, CHEK1 | ||||||
| through cell cycle | FOSB, STAT1, MCM10, CCNE2, PLK3, CDC45L | |||||||||||
| JUN, FOXC1, GADD45B, CCNA2, DMBT1 | ||||||||||||
| 51726 | 1.8935E-6 | 1.4202E-3 | 17 | 510 | regulation of . | E2F1, CDC6, MKI67, IFITM1, NUSAP1, CHEK1 | ||||||
| cell cyc | FOSB, STAT1, MCM10, CCNE2, PLK3, CDC45L | |||||||||||
| JUN, FOXC1, GADD45B, CCNA2, DMBT1 | ||||||||||||
| 6270 | 3.1656E-6 | 2.3742E-3 | 5 | 27 | DNA replication initiation | CCNE2, CDC6, CDC45L, ORC1L, MCM6 | ||||||
| 51706 | 6.8716E-6 | 5.1537E-3 | 12 | 285 | multi-organism | IFIH1, PGLYRP4, IRF7, PRKR, ISG15, OAS1 | ||||||
| process | ISGF3G, CCL5, MX1, STAT1, MX2, DMBT1 | |||||||||||
| 6259 | 9.7883E-6 | 7.3412E-3 | 19 | 704 | DNA metabolic | EXO1, CDC6, PIR51, ECGF1, CHEK1, MCM10 | ||||||
| process | ORC1L, MCM6, TK1, RAD51, CCNE2, TYMS, FOS | |||||||||||
| UHRF1, CDC45L, PFS2, FANCA, IHPK3, RAMP | ||||||||||||
| 6263 | 1.9790E-5 | 1.4843E-2 | 7 | 96 | DNA-dependent DNA replic. | EXO1, CCNE2, CDC6, CDC45L, ORC1L, MCM6, RAD51 | ||||||
| 278 | 2.0740E-5 | 1.5555E-2 | 11 | 267 | mitotic cell cycle | E2F1, SPC25, CDC6, KNTC2, PRC1, HCAP-G | ||||||
| NUSAP1, TOPK, FOXC1, CHEK1, CCNA2 | ||||||||||||
| 9719 | 2.9713E-5 | 2.2285E-2 | 12 | 330 | response to | EXO1, TYMS, UHRF1, PIR51, SGK, CHEK1 | ||||||
| endogenous stim. | FANCA, CCNA2, IHPK3, ABCG1, RAD51, RAMP | |||||||||||
| 6952 | 4.3825E-5 | 3.2869E-2 | 15 | 520 | defense response | ZFP36, IFIH1, TLR2, SAA4, CHST4, C1S | ||||||
| CXCL11, CCL5, BF, FOS, PGLYRP4, TAP1, MX1 | ||||||||||||
| MX2, DMBT1 | ||||||||||||
| 6974 | 4.8613E-5 | 3.6459E-2 | 11 | 293 | response to DNA | EXO1, TYMS, UHRF1, PIR51, SGK, CHEK1 | ||||||
| damage stim. | FANCA, CCNA2, IHPK3, RAD51, RAMP | |||||||||||
| 279 | 5.9994E-5 | 4.4996E-2 | 10 | 248 | M phase | EXO1, SPC25, CDC6, KNTC2, HCAP-G, NUSAP1 | ||||||
| TOPK, CHEK1, CCNA2, RAD51 | ||||||||||||
| 22403 | 6.1999E-5 | 4.6500E-2 | 11 | 301 | cell cycle phase | EXO1, E2F1, SPC25, CDC6, KNTC2, HCAP-G | ||||||
| NUSAP1, TOPK, CHEK1, CCNA2, RAD51 | ||||||||||||
The most significant GO nodes are given with their raw p-values. Under each GO entry is a list of the genes associated with that node.
Genes found in the Human iHOP Network using genes significant at 24 hours as seeds
| Gene | p-value |
|---|---|
| CCL2 | 4.0 × 10-1 |
| CSF2 | 1.0 × 10-2 |
| CSF3 | 3.8 × 10-2 |
| CXCL10 | 1.4 × 10-1 |
| CXCR4 | 8.1 × 10-3 |
| IL1a | 3.2 × 10-1 |
| PTGS2 | 2.1 × 10-2 |
| SOD2 | 1.0 × 10-1 |
| TLR1 | < 1.0 × 10-4 |
| TNF | 1.7 × 10-2 |