| Literature DB >> 21548942 |
Eva Alloza1, Fátima Al-Shahrour, Juan C Cigudosa, Joaquín Dopazo.
Abstract
BACKGROUND: Recent observations point towards the existence of a large number of neighborhoods composed of functionally-related gene modules that lie together in the genome. This local component in the distribution of the functionality across chromosomes is probably affecting the own chromosomal architecture by limiting the possibilities in which genes can be arranged and distributed across the genome. As a direct consequence of this fact it is therefore presumable that diseases such as cancer, harboring DNA copy number alterations (CNAs), will have a symptomatology strongly dependent on modules of functionally-related genes rather than on a unique "important" gene.Entities:
Mesh:
Year: 2011 PMID: 21548942 PMCID: PMC3112060 DOI: 10.1186/1755-8794-4-37
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1GO terms significantly associated to chromosomal regions frequently lost in cancers. Since a large number of GO terms are tested, the popular FDR method [34] was used to adjust the p-values for multiple-testing effects obtained when conducting a one-tailed test [14]. GO terms with p-values under a conventional threshold of p < 0.05 were represented as red octagons and GO terms under a more liberal 0.05 < p < 0.15 were represented as blue octagons. White squares represent non-significant terms connecting the significant terms found. The picture has been obtained using the GoGraphViewer utility of the Babelomics package http://www.babelomics.org[44].
GO terms most significantly associated to chromosomal regions frequently lost in cancers.
| GO term | GO ID | Number of genes | p-value | p-value | Cancer feature |
|---|---|---|---|---|---|
| homophilic cell adhesión | GO:0007156 | 114 | 3.8779 × 10-17 | 4.77 × 10-18 | Metastasis and invasion |
| calcium-dependent cell-cell adhesion | GO:0016339 | 22 | 2.0064 × 10-08 | 9.88 × 10-9 | Metastasis and invasion |
| synaptogenesis | GO:0007416 | 30 | 1.2491 × 10-05 | 1.46 × 10-6 | Unrelated |
| sensory perception of taste | GO:0050909 | 18 | 0.00510791 | 0.00299 | Unrelated |
| cellular component organization and biogenesis | GO:0016043 | 2165 | 0.01731139 | 0.0764 | Replicative potential |
| fertilization (sensu Metazoa) | GO:0009566 | 54 | 0.02135039 | 0.0648 | Unrelated |
| sulfate transport | GO:0008272 | 12 | 0.02815271 | 0.00961 | Metastasis and invasion |
| maintenance of fidelity during DNA-dependent DNA replication | GO:0045005 | 28 | 0.02990008 | 0.0141 | Mutability |
| male gamete generation | GO:0048232 | 210 | 0.0449652 | 0.210 | Unrelated |
| localization of cell | GO:0051674 | 410 | 0.06513653 | 0.697 | Metastasis and invasion |
| mismatch repair | GO:0006298 | 27 | 0.09421995 | 0.0113 | Mutability |
| cell cycle | GO:0007049 | 805 | 0.09599886 | 1.00 | Replicative potential |
| pancreatic ribonuclease activity | GO:0004522 | 15 | 0.0046537 | 0.000623 | Metastasis and invasion? |
| serine-type endopeptidase inhibitor activity | GO:0004867 | 89 | 0.0060532 | 0.000767 | Angiogenesis |
| nucleotide binding | GO:0000166 | 1993 | 0.0064466 | 0.00183 | Mutability? |
| sulfate porter activity | GO:0008271 | 10 | 0.0078682 | 0.00279 | Metastasis and invasion |
| interferon-alpha/beta receptor binding | GO:0005132 | 10 | 0.022324 | 0.0180 | Angiogenesis |
| carboxypeptidase A activity | GO:0004182 | 25 | 0.032232 | 0.0156 | Metastasis and invasion? |
| lipoxygenase activity | GO:0016165 | 7 | 0.040851 | 0.0696 | Metastasis and invasion? |
| cytoskeletal part | GO:0044430 | 613 | 0.0000647 | 0.0000153 | Chromosomal instability |
| microtubule organizing center part | GO:0044450 | 26 | 0.014677 | 0.00849 | Chromosomal instability |
| intermediate filament cytoskeleton | GO:0045111 | 153 | 0.014677 | 0.00924 | Chromosomal instability |
| integral to plasma membrane | GO:0005887 | 1180 | 0.042474 | 1.50 × 10-001 | Metastasis and invasion |
Given that many GO terms are tested, the popular FDR and Bonferroni methods were used to adjust for multiple-testing effects the p-values obtained when conducting a one-tailed test (see details in [14]). GO terms with p-values under a conventional threshold of p < 0.05 are listed. GO terms under a more liberal 0.05 < p < 0.15 are listed in Additional file 1 Tables S1, S2 and S3. Rightmost column makes reference to the cancer feature most probably related to the GO term.
GO terms corresponding to the "biological process" ontology significantly associated to chromosomal regions frequently amplified in cancers.
| GO term | GO ID | Number of genes | p-value | p-value |
|---|---|---|---|---|
| defense response to bacterium | GO:0042742 | |||
| mismatch repair | GO:0006298 | |||
| xenobiotic metabolic process | GO:0006805 | 0.169 | ||
| defense response to fungus | GO:0050832 | 0.101 | ||
| telomere maintenance | GO:0000723 | 24 | 0.14336 | 0.303 |
GO terms with p-values under a conventional threshold of p < 0.05 are listed in boldface. Also, GO terms under a more liberal p < 0.15 are listed in normal face. Nominal p-values obtained by conducting the one-tailed test (see details in [14]) were adjusted for multiple testing using the FDR [34] and the more conservative Bonferroni corrections.
GO terms corresponding to the "biological process" ontology significantly associated to chromosomal regions frequently lost in the glioblastomas [30].
| GO term | GO ID | p-value | p-value |
|---|---|---|---|
| sensory perception of chemical stimulus | GO:0007606 | 9.83 × 10-24 | 1.97 × 10-17 |
| sensory perception | GO:0007600 | 2.39 × 10-17 | 7.17 × 10-14 |
| neurological process | GO:0050877 | 4.45 × 10-12 | 1.78 × 10-06 |
| G-protein coupled receptor protein signaling pathway | GO:0007186 | 5.44 × 10-8 | 0.0272 |
| chromatin assembly | GO:0031497 | 1.60 × 10-5 | 0.000799 |
| chromatin assembly or disassembly | GO:0006333 | 1.85 × 10-5 | 0.00011 |
| cell surface receptor linked signal transduction | GO:0007166 | 0.00028 | 0.00169 |
| DNA packaging | GO:0006323 | 0.00204 | 0.01740 |
| establishment and/or maintenance of chromatin architecture | GO:0006325 | 0.00204 | 0.02042 |
| protein-DNA complex assembly | GO:0065004 | 0.00204 | 0.01926 |
| chromosome organization and biogenesis (sensu Eukaryota) | GO:0007001 | 0.00217 | 0.02389 |
| chromosome organization and biogenesis | GO:0051276 | 0.00234 | 0.02819 |
| lipid metabolic process | GO:0006629 | 0.02525 | 0.17675 |
| gas transport | GO:0015669 | 0.02628 | 0.15771 |
| oxygen transport | GO:0015671 | 0.02628 | 0.15771 |
| organelle organization and biogenesis | GO:0006996 | 0.03449 | 0.41390 |
| regulation of liquid surface tension | GO:0050828 | 0.04568 | 0.04568 |
The one-tailed test, as implemented in the GSA version provided by the Babelomics package [44], was used (see details in [14]). Nominal p-values were adjusted for multiple testing using the FDR [34] and the Bonferroni methods.
Figure 2GO terms significantly associated to chromosomal regions frequently lost in glioblastomas [30]. Since a large number of GO terms are tested, the popular FDR method [34] was used to adjust the p-values for multiple-testing effects obtained when conducting a one-tailed test [14]. GO terms with p < 0.05 were represented as octagons. White squares represent non-significant terms connecting the significant terms found. The picture has been obtained using the GoGraphViewer utility of the Babelomics package http://www.babelomics.org[44].
Figure 3Schematic representation of the gene-set enrichment procedure followed. (a) The CNAs are mapped onto the chromosomal coordinates and then used to (b) build up a ranked list of cytobands, which is further used to (c) define a ranking of genes according to how frequently are they involved in CNAs. The distribution of gene modules, in this case the annotations of a fictitious GO, derived from such ranked list is tested for its significant accumulation in frequently lost regions. (d) The variant of the GSA test used [14] seeks for significant asymmetrical distributions of annotations (red bars) with respect to the annotation background (blue bars). The × axis in d represents the number of cases in which a gene has been observed in region affected by a CAN (that, generally speaking can be either amplification or a loss). The bar height represents the number of genes involved in a given frequency of CNA events. The red bars represent the distribution of the genes of a given GO across the frequency of observations of CNAs. The GSA test seeks for GOs whose distributions are significantly skewed towards high or low frequencies of CNAs. Since a large number of GO terms are tested, the FDR method [34] was used to adjust the p-values for multiple-testing effects. The Mitelman database of Chromosome Aberrations in Cancer http://cgap.nci.nih.gov/Chromosomes/Mitelman, May 2007 release, was used as primary source of information. A total of 86048 observations of deletions (corresponding 19859 to regional deletions and 66189 to whole chromosome deletions), and 55935 observations of amplifications (corresponding to 1011 regional amplifications and 54924 whole chromosome amplifications) including any type of cancer, were obtained from the database.