| Literature DB >> 19208127 |
Smitha Dharan1, Achuthsankar S Nair.
Abstract
BACKGROUND: Biclustering algorithms belong to a distinct class of clustering algorithms that perform simultaneous clustering of both rows and columns of the gene expression matrix and can be a very useful analysis tool when some genes have multiple functions and experimental conditions are diverse. Cheng and Church have introduced a measure called mean squared residue score to evaluate the quality of a bicluster and has become one of the most popular measures to search for biclusters. In this paper, we review basic concepts of the metaheuristics Greedy Randomized Adaptive Search Procedure (GRASP)-construction and local search phases and propose a new method which is a variant of GRASP called Reactive Greedy Randomized Adaptive Search Procedure (Reactive GRASP) to detect significant biclusters from large microarray datasets. The method has two major steps. First, high quality bicluster seeds are generated by means of k-means clustering. In the second step, these seeds are grown using the Reactive GRASP, in which the basic parameter that defines the restrictiveness of the candidate list is self-adjusted, depending on the quality of the solutions found previously.Entities:
Mesh:
Year: 2009 PMID: 19208127 PMCID: PMC2648745 DOI: 10.1186/1471-2105-10-S1-S27
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Algorithm of greedy randomized search procedure
| Algorithm GRASP (Seed) | |
| Current = Seed; | |
| While <termination condition not met> do | |
| Solution ← Greedy_Randomized_ Construction (Current); | |
| Solution ← Local_Search (Solution); | |
| Current ← Solution; | |
| End |
Algorithm of greedy randomized construction phase
| Algorithm Greedy_Randomized_Construction(Seed) | |
| Solution ← Seed | |
| Calculate the incremental costs of the elements not included in the current solution | |
| Build the candidate list | |
| While <termination condition not met> do | |
| Build the restricted candidate list (RCL) | |
| Select an element s at random from RCL | |
| Solution ← Solution ∪ s | |
| Recalculate the incremental costs and candidate list | |
| End | |
| Return Solution | |
| End |
Algorithm of local search phase
| Algorithm Local_Search (Solution) | |
| While there exists | |
| Solution ← Solution ∪ | |
| End | |
| Return Solution |
Figure 1Biclusters extracted from the Yeast gene expression data using GRASP. The biclusters are labelled as A, B, C and D. The number of genes, number of conditions, HScore, p-value of the biclusters are (A) (20, 12, 198.70, 2.8479e-020) (B) (12, 11, 197.7, 3.6879e-015) (C) (11, 13, 197.79, 6.0545e-014) (D) (12, 13, 199.9, 2.9778e-009).
Figure 2Biclusters extracted from the Yeast gene expression data using Reactive GRASP. The biclusters are labelled as A_R, B_R, C_R and D_R. The number of genes, number of conditions, HScore, p-value of the biclusters are (A_R) (22, 12, 198.81, 6.9185e-025) (B_R) (14, 11, 196.08, 1.3464e-017) (C_R) (14, 13, 197.63, 1.3464e-017) (D_R) (17, 13, 199.02, 7.7195e-019).
Figure 3Correspondence plot for the Yeast dataset.
Figure 4Sample biclusters obtained from Reactive GRASP. The biclusters are labelled as B1, B2, B3 and B4. The number of genes, number of conditions, HScore are (B1) (16, 16, 199.79) (B2) (61, 14, 143.03) (B3) (12, 14, 198.50 (B4) (14, 12, 125.83).
Top GO terms (process, function, component) of the biclusters in Figure 4.
| Bicluster | Process | Function | Component |
| B1 | DNA-dependent DNA replication (9, 1.40e-11), DNA replication (9, 3.48e-10), DNA metabolic process(11, 5.45e-09), DNA repair (8, 4.20e-07) | double-stranded DNA binding (4, 4.09e-06), structure-specific DNA binding (4, 8.45e-05), sequence-specific DNA binding(4, 4.5e-04) | replication fork (7,4.10e-11), chromosomal part(8, 2.77e-07) Chromosome(8, 7.84e-07) |
| B2 | Translation (32,2.87e-15), Cellular protein metabolic process (36, 3.59e-9), Protein metabolic process(36, 4.81e-9), Cellular macromolecule biosynthetic process(36,2.51e-08) | Structural constituent of ribosome (28, 3.59e-25) Structural molecule activity (28, 4.71e-20) Translation elongation factor activity (3, .00213) | Cytosolic ribosome (29, 8.7e-30), Cytosolic part (29, 1.2e-26) Ribosome(29, 1.09e-25), Ribosomal subunit(32, 1.13e-24), Ribonucleoprotein complex(34, 7.48e-18) Cytosol (33, 2.05e-17) |
| B3 | Cell cycle process (6, 0.00055), Cell cycle(6, 0.00109), Mitotic cell cycle(5, 0.00145), Cell cycle phase(5, 0.00505) | Kinase regular activity (2, 0.00410) | Cellular bud (6, 1.73e-06), Site of polarized growth(6, 1.87e-06) Cellular bud neck(5, 2.22e-05) Incipient cellular bud site(3,0.00129) |
| B4 | Ribonucleoprotein complex biogenesis and assembly (10, 2.08e-07) Ribosome biogenesis(9, 1.07e-06) | Methyl transferase activity (3,8.56e-03) Transferase activity, transferring one-carbon groups(3,9.17e-03) | Organelle lumen(9,1.2e-04) Intracellular organelle lumen(9, 1.2e-04) Nuclear lumen(8, 2.2e-04) Nucleolus(6, 5.9e-04) |