| Literature DB >> 29593668 |
Syed A Muhammad1,2,3, Jinlei Guo1,2, Thanh M Nguyen1,2,4, Xiaogang Wu5, Baogang Bai1, X Frank Yang6, Jake Y Chen7.
Abstract
Shiga toxin (Stxs) is a family of structurally and functionally related bacterial cytotoxins produced by Shigella dysenteriae serotype 1 and shigatoxigenic group of Escherichia coli that cause shigellosis and hemorrhagic colitis, respectively. Until recently, it has been thought that Stxs only inhibits the protein synthesis and induces expression to a limited number of genes in host cells, but recent data showed that Stxs can trigger several signaling pathways in mammalian cells and activate cell cycle and apoptosis. To explore the changes in gene expression induced by Stxs that have been shown in other systems to correlate with cancer progression, we performed the simulated analysis of cDNA dataset and found differentially expressed genes (DEGs) of human THP1-monocytic cells treated with Stxs. In this study, the entire data (treated and untreated replicates) was analyzed by statistical algorithms implemented in Bioconductor packages. The output data was validated by the k-fold cross technique using generalized linear Gaussian models. A total of 50 DEGs were identified. 7 genes including TSLP, IL6, GBP1, CD274, TNFSF13B, OASL, and PNPLA3 were considerably (<0.00005) related to cancer proliferation. The functional enrichment analysis showed 6 down-regulated and 1 up-regulated genes. Among these DEGs, IL6 was associated with several cancers, especially with leukemia, lymphoma, lungs, liver and breast cancers. The predicted regulatory motifs of these genes include conserved RELA, STATI, IRFI, NF-kappaB, PEND, HLF, REL, CEBPA, DI_2, and NFKB1 transcription factor binding sites (TFBS) involved in the complex biological functions. Thus, our findings suggest that Stxs has the potential as a valuable tool for better understanding of treatment strategies for several cancers.Entities:
Keywords: Shiga toxin; TFBS; cDNA dataset; cancers; differential expression; enrichment analysis
Year: 2018 PMID: 29593668 PMCID: PMC5859033 DOI: 10.3389/fmicb.2018.00380
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Normalization and analysis of array quality metrics (A) Box-plot representing summaries of the signal intensity distributions of the arrays. Each box corresponds to one array. Outlier detection was performed by computing the Kolmogorov-Smirnov statistic Ka between each array's distribution and the distribution of the pooled data (B) Heatmap of the distances between arrays. The color scale is chosen to cover the range of distances encountered in the dataset (C) Histogram representing expression after normalization (D) MA plots. M and A are defined as: M = log2(I1) – log2(I2), A = 1/2 (log2(I1)+log2(I2)), where I1 is the intensity of the array studied, and I2 is the intensity of a “pseudo”-array that consists of the median across arrays.
Figure 2Side-by-side plot produced by plotAffyRNAdeg representing 5′-3′ trend to assess the severity of degradation of RNA and significance level.
A summary statistic for each array in the batch, assessing the severity of RNA degradation and significance level.
| 1 | GSM479983.CEL | 1.01000 | 0.00109 |
| 2 | GSM479984.CEL | 1.000000 | 0.000827 |
| 3 | GSM479985.CEL | 1.41e+00 | 1.15e-05 |
| 4 | GSM479986.CEL | 0.6570 | 0.0183 |
| 5 | GSM479987.CEL | 1.18000 | 0.00033 |
| 6 | GSM479988.CEL | 1.10e+00 | 9.65e-05 |
| 7 | GSM479989.CEL | 0.91700 | 0.00196 |
| 8 | GSM479990.CEL | 0.97500 | 0.00112 |
| 9 | GSM479991.CEL | 1.40e+00 | 1.32e-05 |
List of 50-differentially expressed genes.
| 1 | 240287_at | irg1 | −5.93568 | −50.392 | 7.88E-12 | 2.30E-07 | 14.35412 | Down-regulated |
| 2 | 235737_at | TSLP | −5.81414 | −50.0105 | 8.41E-12 | 2.30E-07 | 14.33257 | |
| 3 | 235574_at | GBP4 | −4.36184 | −40.5895 | 4.93E-11 | 8.98E-07 | 13.64744 | |
| 4 | 205207_at | IL6 | −5.18603 | −37.197 | 1.03E-10 | 1.41E-06 | 13.30453 | |
| 5 | 202510_s_at | TNFAIP2 | −4.1804 | −34.7333 | 1.84E-10 | 2.01E-06 | 13.01096 | |
| 6 | 202269_x_at | GBP1 | −3.20432 | −33.4148 | 2.55E-10 | 2.32E-06 | 12.83562 | |
| 7 | 227458_at | CD274 | −5.57009 | −31.9417 | 3.73E-10 | 2.91E-06 | 12.62265 | |
| 8 | 223501_at | TNFSF13B | −4.88682 | −31.3969 | 4.31E-10 | 2.95E-06 | 12.53892 | |
| 9 | 210797_s_at | OASL | −4.56379 | −29.9747 | 6.38E-10 | 3.87E-06 | 12.3066 | |
| 10 | 214038_at | ccl8 | −6.57392 | −28.3084 | 1.03E-09 | 5.65E-06 | 12.00654 | |
| 11 | 205013_s_at | Adora2a | −3.81754 | −27.4662 | 1.33E-09 | 6.62E-06 | 11.84223 | |
| 12 | 205569_at | LAMP3 | −6.29062 | −26.5359 | 1.78E-09 | 8.03E-06 | 11.64986 | |
| 13 | 229450_at | Ifit3 | −4.28387 | −26.3161 | 1.91E-09 | 8.03E-06 | 11.60267 | |
| 14 | 222793_at | DDX58 | −4.25905 | −25.5566 | 2.44E-09 | 8.71E-06 | 11.43411 | |
| 15 | 218810_at | ZC3H12A | −2.84809 | −25.299 | 2.66E-09 | 8.71E-06 | 11.37495 | |
| 16 | 205660_at | OASL | −4.97761 | −25.0953 | 2.84E-09 | 8.71E-06 | 11.32745 | |
| 17 | 202688_at | TNFSF10 | −5.15206 | −25.0718 | 2.87E-09 | 8.71E-06 | 11.32192 | |
| 18 | 205599_at | Traf1 | −4.14304 | −24.8005 | 3.14E-09 | 8.87E-06 | 11.2575 | |
| 19 | 204747_at | Ifit3 | −4.25172 | −24.7046 | 3.25E-09 | 8.87E-06 | 11.23446 | |
| 20 | 219716_at | APOL6 | −2.70983 | −24.2599 | 3.78E-09 | 9.53E-06 | 11.12554 | |
| 21 | 202759_s_at | akap2/PALM2 | −2.47769 | −24.2173 | 3.84E-09 | 9.53E-06 | 11.11493 | |
| 22 | 230036_at | SAMD9L | −2.94113 | −22.691 | 6.62E-09 | 1.57E-05 | 10.71328 | |
| 23 | 235643_at | SAMD9L | −3.19263 | −22.0591 | 8.39E-09 | 1.84E-05 | 10.53392 | |
| 24 | 223502_s_at | TNFSF13B | −5.13985 | −21.8857 | 8.96E-09 | 1.84E-05 | 10.48325 | |
| 25 | 205992_s_at | IL15 | −4.10568 | −21.8596 | 9.05E-09 | 1.84E-05 | 10.47556 | |
| 26 | 209038_s_at | EHD1 | −3.57268 | −21.8496 | 9.09E-09 | 1.84E-05 | 10.47261 | |
| 27 | 209037_s_at | EHD1 | −4.10382 | −21.4847 | 1.05E-08 | 2.04E-05 | 10.36366 | |
| 28 | 242752_at | – | −2.33834 | −21.2349 | 1.15E-08 | 2.08E-05 | 10.28736 | |
| 29 | 226757_at | Ifit2 | −5.22852 | −21.224 | 1.16E-08 | 2.08E-05 | 10.28402 | |
| 30 | 231577_s_at | GBP1 | −2.98548 | −21.1802 | 1.18E-08 | 2.08E-05 | 10.27046 | |
| 31 | 206157_at | Ptx3 | −3.72465 | −20.8941 | 1.32E-08 | 2.19E-05 | 10.18102 | |
| 32 | 208012_x_at | SP110 | −2.34041 | −20.8471 | 1.35E-08 | 2.19E-05 | 10.16617 | |
| 33 | 202760_s_at | akap2/PALM2 | −3.0481 | −20.8187 | 1.36E-08 | 2.19E-05 | 10.15714 | |
| 34 | 218943_s_at | DDX58 | −4.97703 | −20.6726 | 1.44E-08 | 2.25E-05 | 10.11048 | |
| 35 | 202086_at | mx1 | −3.63011 | −19.9264 | 1.96E-08 | 2.98E-05 | 9.864092 | |
| 36 | 225344_at | NCOA7 | −2.6496 | −19.8461 | 2.03E-08 | 3.00E-05 | 9.836755 | |
| 37 | 215495_s_at | samd4a | −1.78695 | −19.4961 | 2.35E-08 | 3.39E-05 | 9.715643 | |
| 38 | 220104_at | ZC3HAV1 | −3.4764 | −19.3094 | 2.55E-08 | 3.57E-05 | 9.649722 | |
| 39 | 210029_at | IDO1 | −6.46305 | −19.2265 | 2.64E-08 | 3.61E-05 | 9.620158 | |
| 40 | 211122_s_at | CXCL11 | −7.63526 | −19.0851 | 2.81E-08 | 3.75E-05 | 9.569288 | |
| 41 | 1557905_s_at | CD44 | −2.72423 | −18.8999 | 3.05E-08 | 3.88E-05 | 9.501801 | |
| 42 | 1555464_at | IFIH1 | −4.11661 | −18.8975 | 3.05E-08 | 3.88E-05 | 9.500928 | |
| 43 | 229221_at | CD44 | −3.80452 | −18.6478 | 3.41E-08 | 4.16E-05 | 9.408394 | |
| 44 | 210163_at | CXCL11 | −6.82817 | −18.6392 | 3.42E-08 | 4.16E-05 | 9.405188 | |
| 45 | 209723_at | SERPINB9 | −4.33364 | −18.5564 | 3.55E-08 | 4.22E-05 | 9.37409 | |
| 46 | 204103_at | CCL4 | −4.92524 | −18.3405 | 3.91E-08 | 4.55E-05 | 9.292083 | |
| 47 | 214329_x_at | TNFSF10 | −5.31551 | −18.2625 | 4.05E-08 | 4.57E-05 | 9.262101 | |
| 48 | 203915_at | Cxcl9 | −4.43103 | −18.2429 | 4.09E-08 | 4.57E-05 | 9.254554 | |
| 49 | 210285_x_at | WTAP | −2.59583 | −18.1602 | 4.25E-08 | 4.65E-05 | 9.22254 | |
| 50 | 220675_s_at | pnpla3 | 2.423905 | 25.63874 | 2.38E-09 | 8.71E-06 | 11.45275 | Up-regulated |
k-fold cross validation by bioconductor “boot” package using dispersion parameter of Gaussian family.
| (Intercept) | 0.023823 | 0.003194 | 7.46 | 8.81E−14 |
| x1 | 0.349441 | 0.00554 | 63.071 | <2.00E−16 |
| x2 | 0.078029 | 0.00486 | 16.056 | <2.00E−16 |
| x3 | 0.150248 | 0.002554 | 58.83 | <2.00E−16 |
| x4 | −0.09253 | 0.00321 | −28.827 | <2.00E−16 |
| x5 | −0.00143 | 0.002231 | −0.639 | 0.523 |
| x6 | 0.773287 | 0.002425 | 318.819 | <2.00E−16 |
| x7 | −0.22529 | 0.005723 | −39.362 | <2.00E−16 |
| x8 | −0.03568 | 0.00451 | −7.91 | <2.62E−15 |
Deviance Residuals: Min (−3.1894), 1Q (−0.0996), Median (0.0004), 3Q (0.1016), Max (1.2137).
Signif. codes: 0
0.001
0.01
0.05 “.” 0.1 “ ” 1.
Number of Fisher Scoring iterations: 2.
Null deviance: 195495.4 on 54674 degrees of freedom.
Residual deviance: 1786.6 on 54666 degrees of freedom.
The differentially expressed cancer-related genes curated from OMIM and Cancer Genetics. Web databases and their role has been referenced from PubMed.
| 1 | 235737_at | TSLP | 68 | Thymic stromal lymphopoietin | ||
| 2 | 205207_at | IL6 | 1067 | Interleukin-6 | ||
| 3 | 202269_x_at | GBP1 | 26 | Interferon-induced guanylate-binding protein | ||
| 4 | 227458_at | CD274 | 430 | Programmed cell death-1 | ||
| 5 | 223501_at | TNFSF13B | 68 | Tumor necrosis factor ligand superfamily member 13B | ||
| 6 | 210797_s_at | OASL | 12 | 2′-5′-oligoadenylate synthase | ||
| 7 | 220675_s_at | pnpla3 | 27 | Patatin-like phospholipase domain-c |
Figure 3Cluster analysis of 7 cancer-related DEGs with Euclidean distance (Binning method: Quantile). Green corresponds to a small distance and Red to a large distance. Lines indicate the boundaries of the clusters in the level of the tree. Dotted lines indicate down or up-regulated genes.
Functional and GO analysis of 7-differentially expressed genes.
| 1 | [GO:0006955] immune response | 6 | 1.68E-05 | 13.1 |
| 2 | [GO:0042129] regulation of T cell proliferation | 3 | 5.69E-04 | 72.7 |
| 3 | [GO:0050670] regulation of lymphocyte proliferation | 3 | 0.001017 | 54.3 |
| 4 | [GO:0070663] regulation of leukocyte proliferation | 3 | 0.001041 | 53.7 |
| 5 | [GO:0032944] regulation of mononuclear cell proliferation | 3 | 0.001041 | 53.7 |
| 6 | [GO:0050863] regulation of T cell activation | 3 | 0.002007 | 38.5 |
| 7 | [GO:0051249] regulation of lymphocyte activation | 3 | 0.003188 | 30.5 |
| 8 | [GO:0002694] regulation of leukocyte activation | 3 | 0.003992 | 27.2 |
| 9 | [GO:0050865] regulation of cell activation | 3 | 0.004426 | 25.8 |
| 10 | [GO:0005125] cytokine activity | 3 | 0.004485 | 25.0 |
| 11 | [GO:0050871] positive regulation of B cell activation | 2 | 0.019354 | 91.1 |
| 12 | [GO:0001776] leukocyte homeostasis | 2 | 0.021097 | 83.5 |
| 13 | [GO:0042102] positive regulation of T cell proliferation | 2 | 0.022838 | 77.1 |
| 14 | [GO:0050864] regulation of B cell activation | 2 | 0.029772 | 58.9 |
| 15 | [GO:0050671] positive regulation of lymphocyte proliferation | 2 | 0.032074 | 54.7 |
| 16 | [GO:0032946] positive regulation of mononuclear cell proliferation | 2 | 0.032649 | 53.7 |
| 17 | [GO:0070665] positive regulation of leukocyte proliferation | 2 | 0.032649 | 53.7 |
| 18 | [GO:0050870] positive regulation of T cell activation | 2 | 0.044081 | 39.6 |
| 19 | [GO:0002237] response to molecule of bacterial origin | 2 | 0.049753 | 35.0 |
| 20 | [GO:0051251] positive regulation of lymphocyte activation | 2 | 0.055958 | 31.0 |
| 21 | [GO:0048872] homeostasis of number of cells | 2 | 0.057644 | 30.1 |
| 22 | [GO:0002696] positive regulation of leukocyte activation | 2 | 0.061008 | 28.4 |
| 23 | [GO:0050867] positive regulation of cell activation | 2 | 0.063803 | 27.1 |
| 24 | [GO:0042127] regulation of cell proliferation | 3 | 0.074923 | 5.7 |
| 25 | [GO:0002252] immune effector process | 2 | 0.076569 | 22.4 |
GO, Gene Ontology.
Figure 4Functional enrichment analysis of expressively cancer-related DEGs using FunRich annotation tool (A) Biological pathways analysis (B) clinical phenotypes for GEGs.
Figure 5Protein-Protein Interaction network of cancer-related differential expressed genes. Each node represents the protein, while lines indicate the interaction edges. The association of these source proteins and their interactors (target proteins) in cancers are shown in a bar graph. Blue and red color indicate down and up-regulated genes, respectively.
Figure 6The role of 7 DEGs in various cancers with Absolute Pearson correlation. Red color indicates the maximum involvement of a gene in various cancers and Blue color represents the least association. This association was curated using OMIM, Cancer Genomic Web, and PubMed databases.
Figure 7Over-representation of cancer-related DEGs using oPOSSUM with 80% Matrix score.