| Literature DB >> 19426457 |
Junwan Liu1, Zhoujun Li, Xiaohua Hu, Yiming Chen.
Abstract
BACKGROUND: High-throughput microarray technologies have generated and accumulated massive amounts of gene expression datasets that contain expression levels of thousands of genes under hundreds of different experimental conditions. The microarray datasets are usually presented in 2D matrices, where rows represent genes and columns represent experimental conditions. The analysis of such datasets can discover local structures composed by sets of genes that show coherent expression patterns under subsets of experimental conditions. It leads to the development of sophisticated algorithms capable of extracting novel and useful knowledge from a biomedical point of view. In the medical domain, these patterns are useful for understanding various diseases, and aid in more accurate diagnosis, prognosis, treatment planning, as well as drug discovery.Entities:
Mesh:
Year: 2009 PMID: 19426457 PMCID: PMC2681067 DOI: 10.1186/1471-2105-10-S4-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Information of biclusters found on yeast dataset.
| Bicluster | Genes | Conditions | Residue | Row Variance |
| 1 | 79 | 17 | 205.44 | 711.08 |
| 8 | 101 | 16 | 221.12 | 685.33 |
| 12 | 621 | 11 | 200.11 | 1634.32 |
| 21 | 1156 | 10 | 221.42 | 1385.08 |
| 32 | 543 | 12 | 199.11 | 986.09 |
| 44 | 325 | 15 | 231.04 | 999.55 |
| 53 | 1215 | 13 | 281.82 | 778.73 |
| 69 | 87 | 16 | 209.33 | 1085.22 |
| 81 | 1224 | 8 | 201.77 | 943.45 |
| 88 | 1022 | 9 | 203.89 | 911.75 |
This table shows the number of genes and conditions, the mean squared residue and the row variance of ten biclusters out of the one hundred biclusters found on the yeast dataset.
Figure 1Small biclusters of size 24 × 15 on the yeast dataset. This figure shows the expression value of 24 genes under 15 conditions from the found biclusters.
Biclusters found on human dataset.
| Bicluster | Genes | Conditions | Residue | Row Variance |
| 1 | 1088 | 27 | 895.25 | 3141.25 |
| 11 | 812 | 39 | 774.26 | 2598.36 |
| 14 | 1024 | 32 | 986.74 | 3698.54 |
| 21 | 997 | 38 | 1024.11 | 3014.22 |
| 29 | 741 | 43 | 1078.95 | 2987.84 |
| 39 | 135 | 79 | 1098.76 | 3012.88 |
| 48 | 919 | 41 | 980.66 | 3111.54 |
| 54 | 841 | 72 | 1125.87 | 3987.65 |
| 69 | 298 | 79 | 986.58 | 3897.64 |
| 91 | 871 | 43 | 788.19 | 7843.98 |
This table shows the number of genes and conditions, the mean squared residue and the row variance of ten biclusters out of the one hundred biclusters found on the human dataset.
Comparative study of three algorithms.
| NSGA2B | MOPSOB | CMOPSOB | ||||
| Dataset | Yeast | Human | Yeast | Human | Yeast | Human |
| Avg. MSR | 234.87 | 987.56 | 218.54 | 927.47 | 208.86 | 921.66 |
| Avg. size | 10301.71 | 33463.70 | 10510.8 | 34012.24 | 11085.44 | 36400.58 |
| Avg. genes | 1095.43 | 915.81 | 1102.84 | 902.41 | 1118.41 | 931.11 |
| Avg. conditions | 9.29 | 36.54 | 9.31 | 40.12 | 9.45 | 40.14 |
| Max size | 14828 | 37560 | 15613 | 37666 | 15795 | 37679 |
This table compares the performance of three algorithms, and gives the average of mean squared residue, the average number of genes and conditions, the average size and maximal size of the found biclusters
Significant GO terms of genes in clusters.
| Cluster No. | No. of genes | Process | Function | Component |
| 16 | 96 | Lipid transport | Oxidoreductase activity | membrane |
| 56 | 141 | Cell organization and biogenesis | Protein transporter activity | Nucleus |
| 81 | 1024 | Cellular process | tRNA methyltransferase activity | Cytosolic small ribosomal subunit |
This table lists the significant shared GO terms which are used to describe genes in each bicluster for the process, function and component ontology. Here, only shows the terms whose p-values are smaller than 0.01.