| Literature DB >> 25763839 |
Beatriz Pontes1, Ral Girldez2, Jess S Aguilar-Ruiz2.
Abstract
An noticeable number of biclustering approaches have been proposed proposed for the study of gene expression data, especially for discovering functionally related gene sets under different subsets of experimental conditions. In this context, recognizing groups of co-expressed or co-regulated genes, that is, genes which follow a similar expression pattern, is one of the main objectives. Due to the problem complexity, heuristic searches are usually used instead of exhaustive algorithms. Furthermore, most of biclustering approaches use a measure or cost function that determines the quality of biclusters. Having a suitable quality metric for bicluster is a critical aspect, not only for guiding the search, but also for establishing a comparison criteria among the results obtained by different biclustering techniques. In this paper, we analyse a large number of existing approaches to quality measures for gene expression biclusters, as well as we present a comparative study of them based on their capability to recognize different expression patterns in biclusters.Entities:
Mesh:
Year: 2015 PMID: 25763839 PMCID: PMC4357449 DOI: 10.1371/journal.pone.0115497
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Evaluation measures evolution from a shifting base bicluster with induced errors.
Figure 2Evaluation measures evolution from a scaling base bicluster with induced errors.
Figure 3Evaluation measures evolution from a combined base bicluster with induced errors.
Real Datasets used for Performance Comparison.
|
|
|
|
|
|
|---|---|---|---|---|
| Yeast | Yeast | 2884 | 17 | [ |
| Embryonal | Embryonal tumors of the central nervous syst. | 7129 | 60 | [ |
| Leukemia | Leukemia | 7129 | 72 | [ |
| Steminal | Steminal Cells | 26127 | 30 | [ |
Figure 4Mutual information dependencies among metrics for the first test case.
Figure 5Mutual information dependencies among metrics for the second test case.
Mutual Information based Ranking for the first test case.
|
|
| ||
|---|---|---|---|
|
|
|
| |
| bestPValue | MSA (0.924) | ASR (0.919) | PCC |
| meanPValue | ACV (0.786) | SBM (0.773) | VE |
| numSigTerms | ACV (0.748) | AC (0.747) | SBM (0.745) |
|
|
|
|
|
| VAR | MSA (0.985) | PCC | MSR (0.939) |
| MSR | PCC | MSA (0.946) | VAR (0.939) |
| SMSR | MSA (0.926) | PCC | VAR (0.894) |
| PCC | VE | ACV (0.923) | SBM (0.920) |
| PCCcols | MSA (0.977) | VAR (0.953) | MSR (0.946) |
| AC | ACV (0.899) | VE | SCS (0.877) |
| SCS | VEt (0.930) | ACV (0.892) | PCC |
| ACV | SBM (0.953) | VEt (0.934) | MSA (0.933) |
| ASR | MSA (0.969) | SBM (0.957) | PCC |
| SBM | ASR (0.957) | ACV (0.953) | MSA (0.932) |
| MSA | VAR (0.985) | PCC | ASR (0.969) |
| VE | ACV (0.906) | SBM (0.897) | VE |
| VE | ACV (0.934) | SCS (0.930) | PCC |
Mutual Information based Ranking for the second test case.
|
|
| ||
|---|---|---|---|
|
|
|
| |
| bestPValue | MSA (0.932) | ACV (0.924) | VE |
| meanPValue | ACV (0.881) | SBM (0.861) | VE |
| numSigTerms | ACV (0.889) | SBM (0.874) | AC (0.873) |
|
|
|
|
|
| VAR | MSA (0.966) | ASR (0.936) | MSR (0.932) |
| MSR | MSA (0.941) | VAR (0.932) | PCC |
| SMSR | MSA (0.891) | VAR (0.881) | MSR (0.880) |
| PCC | VE | ACV (0.941) | SBM (0.938) |
| PCC | MSA (0.956) | ASR (0.942) | SBM (0.918) |
| AC | SCS (0.952) | VE | ACV (0.940) |
| SCS | VE | AC (0.952) | ACV (0.929) |
| ACV | SBM (0.972) | VE | PCC |
| ASR | MSA (0.963) | SBM (0.945) | PCC |
| SBM | ACV (0.972) | VE | ASR (0.945) |
| MSA | VAR (0.966) | ASR (0.963) | PCC |
| VE | ACV (0.914) | SBM (0.904) | AC (0.900) |
| VE | SCS (0.959) | ACV (0.950) | SBM (0.949) |
Summary of Bicluster Evaluation Measures.
|
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
| ||
| VAR | [ | Exp | ↓ | Exp | ↑ | NT | ↑↑ | |||
| MSR | [ | Exp | • | NS | Exp | ↑ | Exp | ↑ | ||
| SMSR | [ | Exp | ↓↓ | Exp | • | ↓↓ | NT | ↑↑ | ||
| RI | [ | - | - | - | - | - | - | - | - | - |
| PCC | [ | Log | • | NS | Log | ↑ | Log | ↑ | ||
| AC | [ | Exp | NS | Exp | low | Exp | ↑ | |||
| SCS | [ | Exp | • | NS | Exp | NS | Exp | • | NS | |
| ACV | [ | Log | • | NS | Log | • | NS | Log | ↑ | |
| ASR | [ | Cons-Lin-Log | • | ↓↓ | Cons-Lin | • | ↓↓ | Cons-Lin-Log | • | ↓↓ |
| SBM | [ | Lin | NS | Cons-Lin | ↓↓ | Cons-Lin | ↑↑ | |||
| MSA | [ | Exp | ↓↓ | NT | ↑↑ | NT | ↑↑ | |||
| VE | [ | Lin | • | NS | Lin | • | NS | Exp | ↑ | |
| VE | [ | Lin | • | NS | Lin | • | NS | Lin | • | NS |
| SS | [ | - | - | - | - | - | - | - | - | - |