| Literature DB >> 21541025 |
Jesús Lascorz1, Bowang Chen, Kari Hemminki, Asta Försti.
Abstract
BACKGROUND: A large number of gene expression profiling (GEP) studies on prognosis of colorectal cancer (CRC) has been performed, but no reliable gene signature for prediction of CRC prognosis has been found. Bioinformatic enrichment tools are a powerful approach to identify biological processes in high-throughput data analysis. PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21541025 PMCID: PMC3081819 DOI: 10.1371/journal.pone.0018867
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene expression profiling studies on CRC prognosis included in the present study.
| First author | Ref. | Year | Platform | Samples | Study design | Reported gene identifiers (IDs) | Unique, mapped genes |
| Agrawal |
| 2002 | Affymetrix U95A | 60 p.t. | A | 107 | 96 |
| Arango |
| 2005 | Affymetrix U133A | 25 p.t. | B | 234 | 220 |
| Bandres |
| 2007 | Oligo array | 16 p.t. | A | 8 | 6 |
| Barrier (1) |
| 2005 | Affymetrix U133A | 12 p.t.+a.m. | B | 47 | 34 |
| Barrier (2) |
| 2005 | Affymetrix U133A | 25 p.t.+a.m. | B | 100 | 94 |
| Barrier (3) |
| 2006 | Affymetrix U133A | 50 p.t. | B | 30 | 20 |
| Barrier (4) |
| 2007 | Affymetrix U133A | 24 p.t.+a.m. | B | 70 | 63 |
| Bertucci |
| 2004 | cDNA array | 22 p.t.+a.m. | A | 290 | 234 |
| Cavalieri |
| 2007 | Agilent 1A | 19 p.t. | C | 8 | 8 |
| D'Arrigo |
| 2005 | cDNA array | 20 p.t. | A | 29 | 19 |
| Eschrich |
| 2005 | cDNA array | 78 p.t. | C | 43 | 26 |
| Fritzmann |
| 2009 | Affymetrix U95A | 41 p.t., 25 m. | D | 121 | 115 |
| Garman |
| 2008 | Affym. U95A/133A | 52 p.t. | B | 50 | 45 |
| Grade |
| 2007 | Oligo array | 73 p.t. | A | 68 | 66 |
| Jiang |
| 2008 | Affymetrix U133A | 123 p.t. | B | 7 | 7 |
| Jorissen |
| 2009 | Affym. U133Plus | 293 p.t. | D | 128 | 116 |
| Ki |
| 2007 | cDNA array | 23 p.t.+m. | D | 46 | 43 |
| Kleivi |
| 2007 | Agilent 1A | 18 p.t., 4 m. | D | 40 | 40 |
| Komuro |
| 2005 | cDNA array | 89 p.t. | A | 62 | 60 |
| Kwon |
| 2004 | cDNA array | 12 p.t. | A | 60 | 53 |
| Liersch |
| 2009 | cDNA array | 30 p.t. | B | 20 | 15 |
| Lin |
| 2007 | Oligo array+Affym. | 204 p.t. | B | 35 | 32 |
| Smith |
| 2009 | Affym. U133Plus | 55 p.t. | E | 34 | 34 |
| Wang |
| 2004 | Affymetrix U133A | 74 p.t. | B | 23 | 20 |
| Watanabe (2) |
| 2009 | Affym. U133Plus | 36 p.t. | B | 45 | 30 |
| Watanabe (1) |
| 2009 | Affymetrix U133A | 89 p.t. | A | 73 | 57 |
| Yamasaki |
| 2007 | cDNA array | 32 p.t., 32 m. | D | 119 | 82 |
*Number of unique, annotated mapped genes obtained by converting the originally reported gene identifiers (IDs) in each single study to the official HUGO gene symbol. p.t., primary tumours; a.m., adjacent mucosa; m., metastasis; A, metastasis yes/no; B, recurrence yes/no; C, survival; D, metastasis vs. primary tumours; E, prognosis good/bad.
Number of overrepresented GO and KEGG categories in the three gene lists for each of the tools used.
| 54 gene list | 124 gene list | 1475 gene list | |||||||
| Tool name | BP | MF | KEGG | BP | MF | KEGG | BP | MF | KEGG |
| ConsensusPathDB | n.a. | n.a. | 2 | n.a. | n.a. | 2 | n.a. | n.a. | 1 |
| DAVID | 0 | 0 | 0 | 0 | 1 | 0 | 95 | 13 | 5 |
| FatiGO | 1 | 0 | n.a. | 0 | 6 | n.a. | 53 | 4 | n.a. |
| GATHER | 0 | n.a. | 0 | 1 | n.a. | 0 | 11 | n.a. | 1 |
| GeneCodis | 26 | 17 | 8 | 54 | 35 | 21 | 115 | 80 | 116 |
| GOTM | 10 | 10 | n.a. | 10 | 10 | n.a. | 10 | 10 | n.a. |
| g:Profiler | 9 | 0 | 0 | 16 | 1 | 1 | 181 | 18 | 4 |
| Ingenuity | 77 | 0 | 69 | 1 | 61 | 159 | |||
| ToppFun | 35 | 3 | 0 | 29 | 17 | 1 | 234 | 34 | 10 |
| WebGestalt | 40 | 12 | 13 | 40 | 25 | 34 | 40 | 40 | 136 |
Only categories significantly associated after correction for multiple testing (P value<0.05) is shown. BP, Gene Ontology Biological Process; MF, Gene Ontology Molecular Function; KEGG, Kyoto Encyclopedia of Genes and Genomes. n.a., database not applicable.
*Results of the enrichment analysis using the Ingenuity software have to be considered separately, since the software makes use of its own databases, Top Bio Function and Top Canonical Pathways.
Consistently enriched GO and KEGG categories.
| ID | Category | Number of genes in category | 54 gene list | 124 gene list | 1475 gene list | |||
| GO Biological Process (8 tools) | Tools | Genes | Tools | Genes | Tools | Genes | ||
|
|
| 1166 | 0 | - | 4 | 22 | 8 | 175 |
|
|
| 2252 | 3 | 18 | 3 | 31 | 5 | 258 |
|
|
| 2050 | 3 | 18 | 3 | 30 | 5 | 243 |
|
|
| 892 | 2 | 10 | 3 | 18 | 5 | 129 |
|
|
| 853 | 0 | - | 3 | 17 | 6 | 121 |
|
|
| 1520 | 1 | 13 | 4 | 31 | 6 | 250 |
|
| ||||||||
|
|
| 93 | 0 | - | 3 | 6 | 6 | 29 |
|
|
| 216 | 0 | - | 3 | 6 | 5 | 36 |
|
|
| 172 | 0 | - | 3 | 6 | 6 | 30 |
|
|
| 8184 | 3 | 38 | 5 | 80 | 6 | 843 |
|
|
| 113 | 4 | 4 | 1 | 4 | 7 | 30 |
|
| ||||||||
|
|
| 58 | 2 | 2 | 3 | 5 | 4 | 24 |
|
|
| 135 | 2 | 3 | 2 | 5 | 5 | 49 |
|
|
| 143 | 0 | - | 2 | 6 | 4 | 41 |
|
|
| 203 | 1 | 2 | 5 | 7 | 4 | 33 |
|
|
| 241 | 2 | 4 | 2 | 8 | 4 | 67 |
|
|
| 109 | 0 | - | 2 | 5 | 4 | 34 |
|
|
| 65 | 0 | - | 2 | 3 | 4 | 23 |
In each case, the number of enrichment tools reporting the category as significantly overrepresented and the maximal number of genes from the category present in the input gene list are indicated.
Figure 1Bar chart of enrichment ratios for GO and KEGG categories in the three gene lists (54, 124, and 1475 genes).
Ratio of enrichment = number of observed divided by the number of expected genes from each GO or KEGG category in the gene list (according to WebGestalt or, alternatively, DAVID or GOTM tools). GO BP, Gene Ontology Biological Process; GO MF, Gene Ontology Molecular Function; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Overlap of the genes from the consistently enriched GO and KEGG categories in the GEP studies on prognosis of CRC.
| ID | Category | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | p | q | r | ||
| Number genes | 22 | 31 | 30 | 18 | 17 | 31 | 6 | 6 | 6 | 80 | 4 | 5 | 5 | 6 | 7 | 8 | 5 | 3 | |||
|
|
|
|
| 13 | 13 | 9 | 17 | 11 | - | - | - | 20 | 2 | 1 | 1 | - | 5 | - | 2 | ||
|
|
|
|
| 29 | 17 | 14 | 16 | - | - | - | 29 | 3 | 2 | 1 | 1 | - | 6 | - | 2 | ||
|
|
|
|
| 15 | 12 | 16 | - | - | - | 27 | 3 | 2 | 1 | 1 | - | 5 | - | 2 | |||
|
|
|
|
| 7 | 9 | - | - | - | 18 | 2 | 1 | 1 | 1 | - | 4 | - | 2 | ||||
|
|
|
|
| 9 | - | - | - | 16 | 1 | 1 | - | 1 | - | 5 | - | 2 | |||||
|
|
|
|
| 1 | 1 | 1 | 26 | 2 | 2 | 2 | 2 | 1 | 5 | 1 | 1 | ||||||
|
|
|
|
| 6 | 6 | 1 | - | - | - | 4 | 6 | - | 4 | - | |||||||
|
|
|
|
| 6 | 3 | - | - | - | 4 | 6 | - | 4 | - | ||||||||
|
|
|
|
| 3 | - | - | - | 4 | 6 | - | 4 | - | |||||||||
|
|
|
|
| 4 | 5 | 5 | 1 | 1 | 6 | 2 | |||||||||||
|
|
|
|
| - | - | - | - | - | - | - | |||||||||||
|
|
|
|
| 4 | - | - | 1 | - | 1 | ||||||||||||
|
|
|
|
| - | - | 1 | - | 1 | |||||||||||||
|
|
|
|
| 5 | - | 5 | - | ||||||||||||||
|
|
|
|
| - | 5 | - | |||||||||||||||
|
|
|
|
| - | 2 | ||||||||||||||||
|
|
|
|
| - | |||||||||||||||||
|
|
|
|
|
The number of genes from the 124 gene list belonging to each category is indicated, as well as the number of overlapping genes between each pair of categories. The three biologically meaningful category groups identified are marked:
, cell proliferation and apoptosis;
*, oxidative phosphorylation;
, extracellular matrix receptor interaction.