| Literature DB >> 22759615 |
Junwan Liu1, Zhoujun Li, Xiaohua Hu, Yiming Chen, Feifei Liu.
Abstract
BACKGROUND: Multi-objective optimization (MOO) involves optimization problems with multiple objectives. Generally, theose objectives is used to estimate very different aspects of the solutions, and these aspects are often in conflict with each other. MOO first gets a Pareto set, and then looks for both commonality and systematic variations across the set. For the large-scale data sets, heuristic search algorithms such as EA combined with MOO techniques are ideal. Newly DNA microarray technology may study the transcriptional response of a complete genome to different experimental conditions and yield a lot of large-scale datasets. Biclustering technique can simultaneously cluster rows and columns of a dataset, and hlep to extract more accurate information from those datasets. Biclustering need optimize several conflicting objectives, and can be solved with MOO methods. As a heuristics-based optimization approach, the particle swarm optimization (PSO) simulate the movements of a bird flock finding food. The shuffled frog-leaping algorithm (SFL) is a population-based cooperative search metaphor combining the benefits of the local search of PSO and the global shuffled of information of the complex evolution technique. SFL is used to solve the optimization problems of the large-scale datasets.Entities:
Mesh:
Year: 2012 PMID: 22759615 PMCID: PMC3394423 DOI: 10.1186/1471-2164-13-S3-S6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1An individual encoding a bicluster. Figure 1 presents the individual encoding a bicluster with 2 genes and 3 conditions, and its size is 2 × 3 = 6.
Information of biclusters found on yeast dataset
| Bicluster | Genes | Conditions | Residue | Row variance |
|---|---|---|---|---|
| 1 | 101 | 15 | 215.62 | 749.17 |
| 6 | 514 | 10 | 289.65 | 955.25 |
| 14 | 858 | 10 | 322.58 | 702.36 |
| 22 | 478 | 11 | 298.68 | 885.64 |
| 31 | 123 | 12 | 201.88 | 699.87 |
| 36 | 801 | 8 | 221.88 | 687.18 |
| 44 | 1125 | 13 | 236.47 | 598.68 |
| 56 | 847 | 11 | 208.48 | 748.54 |
| 75 | 546 | 9 | 250.14 | 664.13 |
| 89 | 89 | 17 | 210.88 | 666.57 |
Table 1 shows the number of genes and conditions, the mean squared residue and the row variance of ten biclusters out of the one hundred biclusters found on the yeast dataset.
Figure 2Small biclusters of size 24 × 17 on the yeast dataset. Figure 2 shows the expression value of 24 genes under 17 conditions from the small biclusters (bicluster 63).
Biclusters found on human dataset
| Bicluster | Genes | Conditions | Residue | Row variance |
|---|---|---|---|---|
| 1 | 882 | 34 | 987.54 | 3587.26 |
| 4 | 666 | 54 | 1087.25 | 4201.36 |
| 11 | 1024 | 36 | 773.69 | 2930.64 |
| 17 | 1102 | 39 | 1204.65 | 3698.84 |
| 24 | 968 | 37 | 1110.25 | 3548.45 |
| 35 | 805 | 41 | 844.44 | 2987.01 |
| 39 | 871 | 48 | 2874.17 | 2140.36 |
| 44 | 1208 | 29 | 885.74 | 3587.45 |
| 59 | 258 | 86 | 777.58 | 2874.94 |
| 88 | 1508 | 59 | 1405 | 6658.45 |
Table 2 shows the number of genes and conditions, the mean squared residue and the row variance of ten biclusters out of the one hundred biclusters found on the human dataset.
Comparative study of three algorithms
| MOPSOB | MOSFLB | DMOPSOB | MODPSFLB | |||||
|---|---|---|---|---|---|---|---|---|
| Avg. MSR | 218.54 | 927.47 | 215.98 | 913.53 | 216.13 | 905.23 | 212.8 | 904.9 |
| Avg. size | 10510.8 | 34012.24 | 1109.23 | 35507.22 | 11213.5 | 35442.98 | 11220.7 | 35601.8 |
| Avg. genes | 1102.84 | 902.41 | 1148.21 | 928.12 | 1151.25 | 932.57 | 1154.21 | 933.9 |
| Avg. conditions | 9.31 | 40.12 | 9.78 | 43.11 | 9.59 | 42.78 | 9.81 | 43029 |
| Max size | 15613 | 37666 | 15709 | 37871 | 14770 | 37231 | 14827 | 37486 |
| Avg. time | 120.78 | 328.56 | 111.41 | 319.88 | 100.47 | 310.34 | 88.24 | 287.98 |
Table 3 compares the performance of two algorithms. It gives the average of mean squared residue and the average size of the found biclusters, and gives computation cost of two algorithms.
Significant GO terms of genes in three biclusters
| No. of genes | Process | Function | Component | |
|---|---|---|---|---|
| 1 | 101 | Lipid transport (n = 21, p = 0.00389) | Oxidoreductase activity | Membrane |
| 12 | 71 | Physiological process | MAP kinase activity | Cell |
| 33 | 58 | Protein biosynthesis | Structural constituent of ribosome | Cytosolic ribosome |
Table 4 lists the significant shared GO terms which are used to describe genes in each bicluster for the process, function and component ontology.