| Literature DB >> 20860844 |
Fan Shi1, Christopher Leckie, Geoff MacIntyre, Izhak Haviv, Alex Boussioutas, Adam Kowalczyk.
Abstract
BACKGROUND: In the study of cancer genomics, gene expression microarrays, which measure thousands of genes in a single assay, provide abundant information for the investigation of interesting genes or biological pathways. However, in order to analyze the large number of noisy measurements in microarrays, effective and efficient bioinformatics techniques are needed to identify the associations between genes and relevant phenotypes. Moreover, systematic tests are needed to validate the statistical and biological significance of those discoveries.Entities:
Mesh:
Year: 2010 PMID: 20860844 PMCID: PMC2949898 DOI: 10.1186/1471-2105-11-477
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Super-biclusters in gastric cancer dataset
| #Converging | P-values | Most significant biological process | ||||
|---|---|---|---|---|---|---|
| SBC | SBC | Prototype | MCS | Malignancy Score | GO | |
| SBC1 | 11 | 6 | 9.4 × 10-4 | 1.8 × 10-13 | 5.1 × 10-9 | epidermis development |
| SBC2 | 188 | 7 | 1.0 × 10-8 | 7.1 × 10-7 | lipid metabolic process | |
| SBC3 | 2 | 1 | 1.5 × 10-6 | 5.5 × 10-8 | 3.2 × 10-32 | immune system process |
| SBC4 | 96 | 2 | 1.8 × 10-1 | 2.0 × 10-53 | immune system process | |
| SBC5 | 15 | 15 | 1.1 × 10-18 | 7.7 × 10-21 | 1.8 × 10-14 | cell cycle process |
| SBC6 | 328 | 11 | 3.0 × 10-7 | 4.9 × 10-8 | 1.8 × 10-20 | multicellular organismal process |
| SBC7 | 359 | 229 | 4.0 × 10-14 | -5.4 × 10-22 | 3.2 × 10-22 | gen. of precursor metabolic & energy |
| SBC8 | 1 | 1 | 3.0 × 10-10 | -5.2 × 10-8 | 2.2 × 10-2 | lipid metabolic process |
Numerical characterisations and biological relevance of the eight super-biclusters generated by BOA on the gastric cancer data. In the second column of the table, the numbers of biclusters that converged to a particular super-bicluster are given, while the third column is the number of identical biclusters converging to the prototype of that super-bicluster. The columns of "MCS", "Malignancy Score" and "GO" contain the p-values calculated with respect to the prototype of each super-bicluster in terms of the three statistics described in Section 2.3. Note that the negative sign, '-', in the Malignancy Score for SBC7 and SBC8 indicates the significance of agreement with the reverse order.
Figure 1Heat map of super-bicluster 7. Heat map for the prototype of the most prominent super-bicluster, SBC7, generated by the BOA algorithm for the gastric cancer data. The vertical axis shows the 515 most significant genes ordered by f(g) in Algorithm 1, and cut by θ= 5.0, while the horizontal axis shows all samples ordered by h(s) and cut by θ= 4.5. The yellow vertical line in the middle of figure indicates the boundary between the samples in the bicluster (left-side) and others (right-side). The bicluster samples are enriched with the CG subtype with a p-value of 4.32 × 10-10 in terms of the SCS metric or enriched with a combination of {normal, CG, IM} subtypes with a p-value of 4.03 × 10-14 in terms of the MCS metric. Moreover, we observe a strong gradation from least malignant samples (normal and CG), through an intermediate phenotype IM, to the malignant samples (combined intestinal, diffuse and mixed gastric cancers). Two phenotypes, squamous and adenosquamous, with only one sample are annotated with black and white, respectively, but are not shown on the legend. The probability of obtaining such or better ordering by random chance was estimated to have a p-value of 5.35 × 10-22 in terms of Jonckheere's test.
Over-represented GO terms in gastric cancer dataset
| ID | P-value | Biological process |
|---|---|---|
| GO:0006091 | 3.24 × 10-22 | generation of precursor metabolites and energy |
| GO:0006119 | 3.68 × 10-18 | oxidative phosphorylation |
| GO:0006118 | 3.48 × 10-12 | electron transport |
| GO:0042773 | 8.12 × 10-12 | oxidative phosphorylation#ATP synthesis coupled electron transport |
| GO:0042775 | 8.12 × 10-12 | organelle ATP synthesis coupled electron transport |
The five most significantly over-represented GO terms associated with the genes of the prototype of SBC7.
The results are generated from GOstat [16].
Figure 2Saturation metrics for gastric cancer dataset. Gastric cancer benchmark results for five biclustering algorithms. We plot the number of unique biclusters (solid lines) and super-biclusters (broken lines) with the p-value below the threshold indicated by the x-axis. Each algorithm is represented with a unique color as shown in the legend. The results for the super-biclusters are represented with the same color as the biclusters for BOA, ISA and Gibbs (broken lines). Note that Gibbs produces exactly the same lines for biclusters and super-biclusters due to their algorithm. We have used the SCS (left sub- figure) and MCS (right sub- figure) metrics to calculate the p-values. We have applied 1000 random initializations for BOA and ISA and the parameter settings follow the suggestions in these studies.
Figure 3Saturation metrics for lymphoma dataset. Lymphoma dataset benchmark results for five biclustering algorithms. The experimental settings and elements of these figures are the same as the gastric cancer experiments.
Comparison with previous literature
| SBC1 | SBC2 | SBC3 | SBC4 | SBC5 | SBC6 | SBC7 | SBC8 | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Symbol | Annotation | No. Genes | 41 | 217 | 194 | 158 | 227 | 409 | 515 | 146 |
| B | Mitochondrial | 665 | 0 | 0 | 0 | 0 | 0 | 1 | 416 | 9 |
| D1-D3 | Proliferation | 201 | 0 | 0 | 0 | 0 | 76 | 0 | 0 | 0 |
| E | Intestinal | 294 | 1 | 81 | 0 | 0 | 0 | 0 | 1 | 44 |
| F | Intestinal | 157 | 0 | 112 | 0 | 0 | 7 | 1 | 0 | 27 |
| G | Squamous | 37 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| H | Inflamation | 330 | 7 | 0 | 117 | 135 | 9 | 7 | 0 | 30 |
| K | Extra cellular matrix | 877 | 3 | 0 | 67 | 0 | 74 | 392 | 1 | 0 |
Overlapping genes between prototypes of super-biclusters and functional regions in [1]. In the second row we show the number of genes in the SBC prototype.