| Literature DB >> 23185449 |
Sarah C Baetke1, Michiel E Adriaens, Renaud Seigneuric, Chris T Evelo, Lars M T Eijssen.
Abstract
BACKGROUND: Prostate cancer is currently the most frequently diagnosed malignancy in men and the second leading cause of cancer-related deaths in industrialized countries. Worldwide, an increase in prostate cancer incidence is expected due to an increased life-expectancy, aging of the population and improved diagnosis. Although the specific underlying mechanisms of prostate carcinogenesis remain unknown, prostate cancer is thought to result from a combination of genetic and environmental factors altering key cellular processes. To elucidate these complex interactions and to contribute to the understanding of prostate cancer progression and metastasis, analysis of large scale gene expression studies using bioinformatics approaches is used to decipher regulation of core processes. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2012 PMID: 23185449 PMCID: PMC3502280 DOI: 10.1371/journal.pone.0049831
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Standardized microarray data analysis workflow.
Starting from the publicly available EMBL repository ArrayExpress: 1) Relevant prostate cancer studies were selected and downloaded; 2) Quality control and data pre-processing steps were performed in the R environment. Microarrays with insufficient sample quality, hybridization quality, signal comparability or array correlation were excluded; 3) For each included study, statistical analysis was performed and pathway analysis was run with PathVisio to identify the biological processes involved; 4) Results were then integrated and compared to literature findings.
Characteristics of the selected datasets.
| Dataset | ArrayExpress ID | Array type | Number of samples |
|
| E-GEOD-3325 | Affymetrix Human Genome U133 Plus 2.0 | 19 (NP: 6, pPC: 7, mPC: 6) (−0) |
|
| E-GEOD-6956 | Affymetrix Human Genome U133A 2.0 | 46 (NP: 11 (−3), PC: 35 (−1) |
|
| E-GEOD-25136 | Affymetrix Human Genome U133A | 79 (indolent: 40 (−2), recurrent: 39 (−4)) |
|
| E-GEOD-2443 | Affymetrix Human Genome U133A | 20 (AI: 10 (−1), AD: 10 (−1)) |
|
| E-GEOD-20758 | Affymetrix Human Genome U133 2.0 | 6 (PC: 3, stromal cell samples: 3) (−6) |
The number of excluded arrays for each dataset using the standardized QC procedure is indicated in brackets. Abbreviations: NP: normal prostate, pPC: primary prostate cancer, mPC: metastatic prostate cancer, AI: androgen independent, AD: androgen dependent.
Figure 2Overview of the QC results of two selected datasets.
Several QC results of the dataset by Varambally et al. comparing arrays of benign prostate tissue (maroon), primary prostate cancer (blue) and metastatic prostate cancer samples (green) are shown in panel a-d. Several QC results of the dataset by Gregg et al. comparing arrays of prostatic epithelial (maroon) with interstitial stromal cell samples (teal) are depicted in panel e-h. a) Boxplot of raw intensities; b) density histogram of log intensities before normalization; c) density histogram of log intensities after normalization; d) MA-plot before (upper panel) and after normalization (lower panel) of one example array of the dataset by Varambally et al.; e) boxplot of raw intensities; f) density histogram of log intensities before normalization; g) density histogram of log intensities after normalization; h) MA-plot before (upper panel) and after normalization (lower panel) of one example array of the dataset by Gregg et al.
PathVisio results of significant pathways found in the dataset by Varambally et al. comparing processed data provided by ArrayExpress with the reprocessed data.
| Pathway | Z Score (ArrayExpress) | Z Score (Standardized processing) |
|
|
|
|
| Glutathione metabolism |
| 1.59 |
| Striated Muscle Contraction |
| 0.78 |
| Endochondral Ossification |
| 1.32 |
| Delta-Notch Signaling Pathway |
| 0.68 |
|
|
|
|
|
|
|
|
| Eicosanoid Synthesis | 1.53 |
|
| Prostaglandin Synthesis and Regulation | 1.34 |
|
| Id Signaling Pathway | 1.14 |
|
| Selenium | 0.98 |
|
| Nicotine Activity on Dopaminergic Neurons | 0.69 |
|
| Cytoplasmic Ribosomal Proteins | 0.10 |
|
| Irinotecan Pathway | 0.07 |
|
| Ganglio Sphingolipid Metabolism | −0.21 |
|
| Sulfation | −0.34 |
|
Pathway analysis is based on a comparison between benign prostate tissue and primary prostate cancer. Only significant pathways with a Z-score >1.9 in at least one of the two analyses are included. Significant Z-scores are depicted in bold; matches in pathways between the analyses are in italics.
PathVisio results of significant pathways found in the dataset by Varambally et al. comparing processed data provided by ArrayExpress with the reprocessed data.
| Pathway | Z Score (ArrayExpress) | Z Score (Standardized processing) |
|
|
|
|
|
|
|
|
| Glucuronidation |
| 0.62 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| EGFR1 Signaling Pathway |
| 1.80 |
|
|
|
|
|
|
|
|
| IL-3 Signaling Pathway |
| 1.16 |
| Wnt Signaling Pathway NetPath |
| 1.55 |
| IL-7 Signaling Pathway |
| 1.40 |
| FAS pathway and Stress induction of HSP regulation |
| 0.48 |
|
|
|
|
| DNA damage response (only ATM dependent) |
| 1.56 |
|
|
|
|
|
|
|
|
| p38 MAPK Signaling Pathway (BioCarta) |
| 0.66 |
| Senescence and Autophagy |
| 1.72 |
|
|
|
|
| Insulin Signaling |
| 0.38 |
|
|
|
|
|
|
|
|
| Focal Adhesion |
| 0.71 |
| Endochondral Ossification | 1.69 |
|
| Nucleotide Metabolism | 1.68 |
|
| Myometrial Relaxation and Contraction Pathways | 1.55 |
|
| DNA Replication | 1.27 |
|
| One Carbon Metabolism | 0.88 |
|
| Angiogenesis | 0.68 |
|
Pathway analysis is based on a comparison between primary prostate cancer and metastatic prostate cancer. Only significant pathways with a Z-score >1.9 in at least one of the two analyses are included. Significant Z-scores are depicted in bold; matches in pathways between the analyses are in italics.
PathVisio results of significant pathways found in the dataset by Wallace et al. comparing processed data provided by ArrayExpress with the reprocessed data after quality control.
| Pathway | Z Score (ArrayExpress) | Z Score (Standardized processing) |
| Cytoplasmic Ribosomal Proteins |
| 0.17 |
|
|
|
|
| EGFR1 Signaling Pathway |
| 1.35 |
|
|
|
|
| TNF-alpha/NF-kB Signaling Pathway |
| 1.05 |
|
|
|
|
| Signaling of Hepatocyte Growth Factor Receptor |
| 0.82 |
| Androgen Receptor Signaling Pathway |
| 0.57 |
|
|
|
|
| IL-9 Signaling Pathway |
| −0.34 |
| T Cell Receptor Signaling Pathway |
| 0.37 |
| Non-homologous end joining |
| 1.36 |
| Proteasome Degradation |
| 1.02 |
| Serotonin Receptor 4/6/7 NR3C signaling |
| −0.49 |
| Notch Signaling Pathway |
| −0.89 |
| Fatty Acid Biosynthesis |
| −0.34 |
| Insulin Signaling | 1.49 |
|
Pathway analysis is based on a comparison between normal prostate tissue and prostatic adenocarcinoma. Only significant pathways with a Z-score >1.9 in at least one of the two analyses are included. Significant Z-scores are depicted in bold; matches between the analyses are in italics.
PathVisio results of significant pathways found in the dataset by Sun et al. comparing processed data provided by ArrayExpress with the reprocessed data after quality control.
| Pathway | Z Score (ArrayExpress) | Z Score (Standardized processing) |
| miRNAs involved in DDR |
| −1.22 |
| Angiogenesis |
| 0.22 |
| IL-2 Signaling Pathway |
| −0.94 |
| FAS pathway and Stress induction of HSP regulation |
| −0.66 |
| T Cell Receptor Signaling Pathway |
| 1.27 |
| p38 MAPK Signaling Pathway (BioCarta) |
| −1.81 |
| B Cell Receptor Signaling Pathway |
| 0.20 |
| Serotonin Receptor 4/6/7 NR3C signaling |
| −0.59 |
| IL-5 Signaling Pathway |
| −0.69 |
| G1 to S cell cycle control |
| −2.49 |
| Cell cycle |
| NaN |
| TCA Cycle |
| NaN |
| Type II interferon signaling (IFNG) |
| −0.07 |
| DNA damage response |
| −1.08 |
| GPCRs, Class B Secretin-like | 0.17 |
|
| Inflammatory Response Pathway | 0.13 |
|
| Cholesterol Biosynthesis | −0.99 |
|
Pathway analysis is based on a comparison between recurrent and non-recurrent prostate cancer. Only significant pathways with a Z-score >1.9 in at least one of the two analyses are included. Significant Z-scores are depicted in bold. A NaN value commonly occurs when none of the genes in the pathway is present in the dataset.
PathVisio results of significant pathways found in the dataset by Best et al. comparing processed data provided by ArrayExpress with the reprocessed data after quality control.
| Pathway | Z Score (ArrayExpress) | Z Score (Standardized processing) |
|
|
|
|
| Catalytic cycle of mammalian FMOs |
| 0.44 |
|
|
|
|
|
|
|
|
| IL-1 Signaling Pathway |
| −0.41 |
| Focal Adhesion |
| 1.00 |
| Nifedipine |
| 0.69 |
| Complement and Coagulation Cascades KEGG |
| 1.20 |
| Selenium metabolism/Selenoproteins | 1.80 |
|
| TGF-beta Receptor Signaling Pathway | 1.30 |
|
| ErbB signaling pathway | 1.02 |
|
| DNA damage response | 0.82 |
|
| Translation Factors | 0.77 |
|
| Blood Clotting Cascade | 0.32 |
|
| Oxidative Stress | 0.11 |
|
Pathway analysis is based on a comparison between androgen-dependent and androgen-independent prostate cancer. Only significant pathways with a Z-score >1.9 in at least one of the two analyses are included. Significant Z-scores are depicted in bold; matches between the analyses are in italics.