| Literature DB >> 21269471 |
Michael A Thomas1, Luobin Yang, Barbara J Carter, Rebecca D Klaper.
Abstract
BACKGROUND: Methods for gene-class testing, such as Gene Set Enrichment Analysis (GSEA), incorporate biological knowledge into the analysis and interpretation of microarray data by comparing gene expression patterns to pathways, systems and emergent phenotypes. However, to use GSEA to its full capability with non-mammalian model organisms, a microarray platform must be annotated with human gene symbols. Doing so enables the ability to relate a model organism's gene expression, in response to a given treatment, to potential human health consequences of that treatment. We enhanced the annotation of a microarray platform from a non-mammalian model organism, and then used the GSEA approach in a reanalysis of a study examining the biological significance of acute and chronic methylmercury exposure on liver tissue of fathead minnow (Pimephales promelas). Using GSEA, we tested the hypothesis that fathead livers, in response to methylmercury exposure, would exhibit gene expression patterns similar to diseased human livers.Entities:
Mesh:
Year: 2011 PMID: 21269471 PMCID: PMC3037904 DOI: 10.1186/1471-2164-12-66
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
GSEA tests for GO classes enriched following acute treatment
| GO term | GO definition | Size | ES | NES | NOM p-value |
|---|---|---|---|---|---|
| GO:0042254 | 14 | 0.708 | 1.591 | 0.000 | |
| GO:0006364 | 11 | 0.717 | 1.510 | 0.000 | |
| GO:0015031 | 122 | 0.437 | 1.614 | 0.014 | |
| GO:0006754 | 56 | 0.438 | 1.360 | 0.014 | |
| GO:0006486 | Protein amino acid glycosylation | 75 | 0.378 | 1.280 | 0.133 |
| GO:0006605 | 85 | 0.436 | 1.557 | 0.013 | |
| GO:0009058 | Biosynthesis | 327 | 0.324 | 1.315 | 0.149 |
| GO:0045045 | Secretory pathway | 59 | 0.422 | 1.448 | 0.064 |
| GO:0007046 | Ribosome biogenesis | 94 | 0.552 | 1.553 | 0.054 |
| GO:0006457 | Protein folding | 44 | 0.466 | 1.387 | 0.086 |
| GO:0006633 | Fatty acid biosynthesis | 11 | 0.411 | 1.162 | 0.296 |
| GO:0006888 | ER to golgi vesicle mediated transport | 11 | 0.365 | 0.864 | 0.676 |
| GO:0009165 | Nucleotide biosynthesis | 12 | -0.379 | -1.047 | 0.458 |
| 10 | -0.647 | -1.610 | 0.000 | ||
Results for gene sets representing each GO-BP class identified by Klaper et al. [12,13], tested for enrichment by GSEA following acute methylmercury treatment. GO terms and definitions can be used to retrieve each set (and constituent genes) from MSigDB [8]; size refers to the number of genes in the set; ES and NES are the enrichment scores and normalized enrichment scores (respectively) for the set; NOM p-value is the nominal p-value for the NES. Gene sets representing GO-BP classes with statistically significant normalized enrichment scores are highlighted in bold.
GSEA tests for GO classes enriched following chronic treatment
| GO term | GO definition | Size | ES | NES | NOM p-value |
|---|---|---|---|---|---|
| GO:0006605 | Protein targeting | 85 | 0.387 | 1.416 | 0.054 |
| GO:0016485 | Protein processing | 32 | 0.413 | 1.267 | 0.086 |
| GO:0006897 | Endocytosis | 162 | 0.253 | 1.091 | 0.278 |
| GO:0000226 | Microtubule cytoskeleton organization and biogenesis | 28 | 0.376 | 1.004 | 0.484 |
| GO:0007409 | Axonogenesis | 29 | 0.362 | 0.992 | 0.614 |
| GO:0045893 | Positive regulation of transcription, DNA dependent | 270 | 0.200 | 0.898 | 0.502 |
| GO:0006887 | Exocytosis | 15 | 0.402 | 0.856 | 0.763 |
| GO:0006289 | Nucleotide-excision repair | 14 | 0.287 | 0.842 | 0.806 |
| GO:0007269 | Neurotransmitter secretion | 10 | 0.302 | 0.702 | 0.786 |
| GO:0006917 | Induction of apoptosis | 208 | -0.286 | -1.093 | 0.396 |
| GO:0009615 | Response to virus | 21 | -0.301 | -1.198 | 0.130 |
| GO:0001756 | Somitogenesis | 17 | -0.356 | -1.020 | 0.502 |
Results for each GO-BP class identified by Klaper et al. [12,13], tested for enrichment by GSEA following chronic methylmercury treatment. Size refers to the number of genes in the set; ES and NES are the enrichment scores and normalized enrichment scores (respectively) for the set; NOM p-value is the nominal p-value for the NES.
GSEA tests for enrichment of MSigDB C2 gene sets
| Gene Set Name | Size | ES | NES | NOM p-value | FDR q-value | ||
|---|---|---|---|---|---|---|---|
| Acute exposure | Down-regulated | None | |||||
| Up-regulated | XU_CBP_UP | 19 | 0.633 | 1.987 | 0.000 | 0.051 | |
| CALRES_MOUSE_DN | 28 | 0.575 | 1.920 | 0.000 | 0.108 | ||
| BYSTRYKH_HSC_CIS_GLOCUS | 82 | 0.516 | 1.889 | 0.000 | 0.097 | ||
| BYSTRYKH_HSC_BRAIN_CIS_GLOCUS | 43 | 0.600 | 1.855 | 0.000 | 0.102 | ||
| PENG_GLUTAMINE_DN | 209 | 0.519 | 1.828 | 0.000 | 0.130 | ||
| UVC_LOW_ALL_DN | 38 | 0.473 | 1.817 | 0.000 | 0.121 | ||
| CMV_ALL_UP | 64 | 0.530 | 1.764 | 0.000 | 0.214 | ||
| SANSOM_APC_LOSS5_UP | 55 | 0.544 | 1.734 | 0.037 | 0.274 | ||
| BLEO_MOUSE_LYMPH_HIGH_24HRS_DN | 31 | 0.669 | 1.730 | 0.000 | 0.255 | ||
| SHEPARD_CRASH_AND_BURN_MUT_VS_WT_UP | 124 | 0.452 | 1.710 | 0.000 | 0.289 | ||
| PENG_RAPAMYCIN_DN | 163 | 0.516 | 1.706 | 0.023 | 0.275 | ||
| PRMT5_KD_DN | 19 | 0.572 | 1.694 | 0.038 | 0.298 | ||
| HSA00970_AMINOACYL_TRNA_BIOSYNTHESIS | 29 | 0.702 | 1.690 | 0.029 | 0.286 | ||
| MRNA_SPLICING | 44 | 0.633 | 1.689 | 0.000 | 0.266 | ||
| WANG_MLL_CBP_VS_GMP_DN | 30 | 0.579 | 1.688 | 0.000 | 0.254 | ||
| JAIN_NEMO_DIFF | 59 | 0.458 | 1.688 | 0.000 | 0.240 | ||
| HSA00563_GLYCOSYLPHOSPHATIDYLINOSITOL_ANCHOR_BIOSYNTHESIS | 16 | 0.616 | 1.684 | 0.000 | 0.239 | ||
| BLEO_MOUSE_LYMPH_LOW_24HRS_DN | 22 | 0.697 | 1.681 | 0.000 | 0.238 | ||
| OLDONLY_FIBRO_UP | 28 | 0.485 | 1.674 | 0.000 | 0.243 | ||
| CMV_24HRS_UP | 54 | 0.524 | 1.673 | 0.000 | 0.232 | ||
| KUROKAWA_5FU_IFN_SENSITIVE_VS_RESISTANT_DN | 22 | 0.624 | 1.669 | 0.000 | 0.232 | ||
| CHANG_SERUM_RESPONSE_UP | 109 | 0.480 | 1.668 | 0.000 | 0.224 | ||
| ROS_MOUSE_AORTA_UP | 21 | 0.731 | 1.656 | 0.036 | 0.237 | ||
| UVB_NHEK2_DN | 64 | 0.456 | 1.649 | 0.000 | 0.242 | ||
| SCHUMACHER_MYC_UP | 44 | 0.504 | 1.649 | 0.023 | 0.234 | ||
| STEMCELL_COMMON_UP | 149 | 0.458 | 1.643 | 0.023 | 0.242 | ||
| MYC_ONCOGENIC_SIGNATURE | 137 | 0.406 | 1.641 | 0.000 | 0.239 | ||
| ZHAN_MMPC_SIM_BC_AND_MM | 36 | 0.472 | 1.635 | 0.000 | 0.242 | ||
| AMINOACYL_TRNA_BIOSYNTHESIS | 21 | 0.732 | 1.633 | 0.048 | 0.238 | ||
| Chronic exposure | Down-regulated | TNFR1PATHWAY | 24 | -0.724 | -1.968 | 0.000 | 0.026 |
| CASPASEPATHWAY | 16 | -0.684 | -1.865 | 0.000 | 0.049 | ||
| DEATHPATHWAY | 25 | -0.719 | -1.836 | 0.000 | 0.063 | ||
| HIVNEFPATHWAY | 44 | -0.593 | -1.765 | 0.000 | 0.104 | ||
| MITOCHONDRIAPATHWAY | 17 | -0.785 | -1.728 | 0.000 | 0.151 | ||
| TSA_PANC50_UP | 25 | -0.584 | -1.684 | 0.000 | 0.216 | ||
| Up-regulated | None |
Results for MSigDB C2 gene sets (drawn from 1892 sets representing all systems, pathways and functions in that database), listing sets with NES scores that are both statistically significant and below the FDR threshold. Size refers to the number of genes in the set; ES and NES are the enrichment scores and normalized enrichment scores (respectively) for the set; NOM p-value is the nominal p-value for the NES, FDR q-vlaue is the false discovery rate ratio (values lower than 0.25 are considered to be passing as per Subramanian et al. [4]). One of these sets (in bold) is a liver associated set highlighted in Table 4.
GSEA tests for enrichment of HCC- and hepatitis-associated gene sets
| Gene Set Name | Size | ES | NES | NOM p-value |
|---|---|---|---|---|
| KUROKAWA_5FU_IFN_SENSITIVE_VS_RESISTANT_DN | 25 | 0.585 | 1.642 | 0.025 |
| WONG_IFNA_HCC_RESISTANT_VS_SENSITIVE_UP | 10 | 0.507 | 1.407 | 0.046 |
| HBX_HEP_UP | 12 | -0.540 | -1.460 | 0.038 |
| HCC_SURVIVAL_GOOD_VS_POOR_DN | 112 | 0.452 | 1.549 | 0.000 |
| None | ||||
Results for gene sets representing Hepatocellular carcinoma (HCC) and hepatitis, tested for enrichment by GSEA following methylmercury treatment. Here, we test whether liver-associated sets are enriched following methylmercury exposure. Size refers to the number of genes in the set; ES and NES are the enrichment scores and normalized enrichment scores (respectively) for the set; NOM p-value is the nominal p-value for the NES. Descriptions of each set (and lists of constituent genes) can be found by searching for set names at the MSigDB [8].