| Literature DB >> 26628919 |
Haifa Ben Saber1, Mourad Elloumi2.
Abstract
The biclustering of microarray data has been the subject of a large research. No one of the existing biclustering algorithms is perfect. The construction of biologically significant groups of biclusters for large microarray data is still a problem that requires a continuous work. Biological validation of biclusters of microarray data is one of the most important open issues. So far, there are no general guidelines in the literature on how to validate biologically extracted biclusters. In this paper, we develop two biclustering algorithms of binary microarray data, adopting the Iterative Row and Column Clustering Combination (IRCCC) approach, called BiBinCons and BiBinAlter. However, the BiBinAlter algorithm is an improvement of BiBinCons. On the other hand, BiBinAlter differs from BiBinCons by the use of the EvalStab and IndHomog evaluation functions in addition to the CroBin one (Bioinformatics 20:1993-2003, 2004). BiBinAlter can extracts biclusters of good quality with better p-values.Entities:
Keywords: Algorithm; Biclustering; Evaluation function; Microarray data analysis
Year: 2015 PMID: 26628919 PMCID: PMC4666179 DOI: 10.1186/s13040-015-0070-4
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Corresponding parameters values of our algorithms
| Algorithms | Corresponding parameters values |
|---|---|
|
| |
|
|
Values of Shared and NotShared for non overlapping biclusters
| Algorithms |
|
|
|---|---|---|
| CC | 18.21 % | 36.57 % |
| OPSM | 46.39 % | 74.42 % |
| ISA | 39.38 % | 5.31 % |
| BiMax | 58.18 % | 21.39 % |
|
| 88 % | 12 % |
|
| 100 % | 37.03 % |
Values of Shared and NotShared for overlapping biclusters
| Algorithms |
|
|
|---|---|---|
| CC | 13.21 % | 36.57 % |
| OPSM | 82.02 % | 50.51 % |
| ISA | 29.28 % | 7.31 % |
| BiMax | 48.18 % | 22.39 % |
|
| 87.30 % | 61 % |
|
| 89.40 % | 57.32 % |
Number of biclusters obtained by our algorithms on real datasets
| Algorithms |
|
|
|---|---|---|
|
| 883 | 1921 |
|
| 708 | 1720 |
|
| 529 | 1900 |
|
| 881 | 1769 |
|
| 708 | 1700 |
Values of Coverage for Yeast cell cycle and Human B-cell Lymphoma datasets
| Datasets | Algorithms | Total coverage | Genes coverage | Conditions coverage |
|---|---|---|---|---|
| Yeast celll cycle | CC | 81.47 % | 97.12 % | 100 % |
|
| 39.14 % | 44.5 % | 100 % | |
|
| 47 % | 48,03 % | 100 % | |
| Human B-cell Lymphoma | CC | 36.81 % | 91.58 % | 100 % |
|
| 34.14 % | 37.51 % | 100 % | |
|
| 41 % | 46.13 % | 100 % |
The most important terms of GO for the two most significant extracted biclusters from Yeast cell cycle dataset by BiBinCons and BiBinAlter
| Biclusters | Biological process | Molecular function | Cell component |
|---|---|---|---|
| 12 genes, 13 conditions | Cellular response to chromatin binding microtubule organizing 13 conditions DNA damage stimulus (25 %,0.00037) center part (66.7 %, 1:87 * 10-8) (16.7 %, 0.00742) response to DNA damage stimulus (66.7 %, 6:30 * 10-8) cellular response to stress (66.7 %, 2:12 * 10-7) cellular response to stimulus (66,7 %, 3:25 * 10-7) DNA repair (50 %, 2:58 * 10-5) response to stress (66.7 %, 2:98 * 10-5) | Chromatin binding microtubule organizing 13 conditions DNA damage stimulus (25 %,0.00037) | Microtubule organizing 13 conditions DNA damage stimulus (25 %,0.00037) center part (66.7 %, 1:87 * 10-8) (16.7 %, 0.00742) |
| 11 genes, 11 conditions | Cell cycle process GTPase activator microtubule cytoskeleton 11 conditions (63.6 %, 2:93 * 10-5) activity (18.2 %,0.00994) (45.5 %, 6:33 * 10-6) cell cycle microtubule organizing (63.6 %, 6:85 * 10-5) | GTPase activator microtubule cytoskeleton 11 conditions (63.6 %, 2:93 * 10-5) activity (18.2 %,0.00994) | Microtubule cytoskeleton 11 conditions (63.6 %, 2:93 * 10-5) activity (18.2 %,0.00994) (45.5 %, 6:33 * 10-6) cell cycle microtubule organizing (63.6 %, 6:85 * 10-5) center (36.4 %,4:97 * 10-5) spindle pole body (36.4 %, 4:97 * 10-5) spindle pole (36.4 %, 6:77 * 10-5) |
Computing time of our algorithms
| Datasets |
|
|
|---|---|---|
|
| 32 min | 37 min 12 sec |
|
| 8 min | 8 min 3 sec |