| Literature DB >> 23825635 |
Shengjun Fan1, Zhenyu Pan, Qiang Geng, Xin Li, Yefan Wang, Yu An, Yan Xu, Lu Tie, Yan Pan, Xuejun Li.
Abstract
BACKGROUND: Ulcerative colitis (UC) was the most frequently diagnosed inflammatory bowel disease (IBD) and closely linked to colorectal carcinogenesis. By far, the underlying mechanisms associated with the disease are still unclear. With the increasing accumulation of microarray gene expression profiles, it is profitable to gain a systematic perspective based on gene regulatory networks to better elucidate the roles of genes associated with disorders. However, a major challenge for microarray data analysis is the integration of multiple-studies generated by different groups. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2013 PMID: 23825635 PMCID: PMC3692446 DOI: 10.1371/journal.pone.0067142
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Microarray data sets utilized in this experiment.
| Authors | Documents (PMID) | GEO number | Array type | Number of samples |
|
| 20087348 | GSE4183 | Affymetrix Human Genome U133 Plus 2.0 Array | 30(IBD:15; CRC:15) |
|
| 23388545 | GSE37283 | Affymetrix HT HG-U133+ PM Array | 15(UC:4;CRC:11) |
Abbreviations: IBD: inflammatory bowel disease, CRC: colorectal carcinoma, UC: ulcerative colitis.
Figure 1Robust multi-array average (RMA) analysis results of microarray data based on WebArray.
(A) Statistical analysis result plot for GSE4183 included M-A plot, M-B plot, M histogram and B statistics histogram. (B) Statistical analysis result plot for GSE37283 included M-A plot, M-B plot, M histogram and B statistics histogram. M: the log-differential expression ratio; A: the log-intensity of spot, a measure of overall brightness of spot; B: B statistics, the log-odds of differential expression.
Figure 2Cross-study normalization and integration results of two separate microarrays based upon ArrayMining.
Figure 3Signaling regulatory modules of human cancer signaling network generated by AllegroMCODE based on molecular complex detection (MCODE) algorithm.
A-H represented signaling regulatory networks 1–8. Circle dots in the networks corresponded to genes. Red represented high degree connectivity, whereas green stood for low degree connectivity.
Simulated results for the random microarray data sets (RDSs).
| Rank of RDSs | Virtual microarray data sets | Drivergenes | Signalingnetworks |
| 1 | IBD(GSM95520,95513,95522,95518,95521,95517); CRC(GSM915466,915461,95499,95510,915465,95501) | CDK6,CEBPD,SOCS1,CD19,BCL6 | 3,6,7 |
| 2 | IBD(GSM915451,915452,95512,95523,95515,95524,95514);CRC(GSM91546,95502,915463,915470,95506,95498) | PTGS2 | 6 |
| 3 | IBD(GSM95515,95511,95517,95521,95513,95518); CRC(GSM95496,95497,915470,95508,95500,95509) | NEDD4L | 3 |
| 4 | IBD(GSM95520,95523,915452,95514,915454,95525); CRC(GSM915469,915465,95505,95506,95502,95510) | SUMO1,SYT1,SOX2,PTGS2 | 1,3,6,7 |
| 5 | IBD(GSM95512,95520,915453,95519,95511,95524); CRC(GSM95509,95507,95510,915464,915468,915467) | STK4,MORF4L2 | 6,7 |
| 6 | IBD(GSM95518,915452,95516,95525,95522,91515); CRC(GSM915469,95500,95496,915461,95499,95497) | STAT1 | 6 |
| 7 | IBD(GSM915451,95523,95522,95515,915453,95521); CRC(GSM915469,915464,95504,95503,915467,915460) | PTGS2 | 6 |
| 8 | IBD(GSM915451, 915452, 915453, 915454); CRC(GSM915460, 915461, 915462, 915463, 915464, 915465, 915466, 915467, 915468, 915469,915470) | CXCR4,SMAD3,PTGS2,PTPRC,BCL6,TLR4 | 3,6,7 |
Abbreviations: IBD: inflammatory bowel disease, CRC: colorectal carcinoma.
Figure 4Layered signaling regulatory networks driven by co-differently-expressed microarray genes involved in malignant transition of colorectal cancer from the benign chronic non-solving ulcerative colitis to the more aggressive one.
In the layered signaling regulatory networks, the size of each node was proportional to the degree. In addition, red, blue and green represented nodes stemmed from signaling networks 3, 6 and 7, respectively.
Figure 5Heatmap of the over-represented biological processes and enrichment pathways using stats package in R environment.
For each figure, columns correspond to biological processes (A) or signaling pathways (B), and rows correspond to gene cluster category and subcellular localization. Expression values are logarithm of ratio value utilizing log transform data. Red and blue in each grid represented positive, while white represented null. (A) Significantly over-represented GO biological processes in differential cluster and layers. (B) Significantly enriched signaling pathways in differential cluster and layers. E: Extracellular. M: Membrane. C: Cytoplasm. N: Nucleus.