| Literature DB >> 24516364 |
Li-Ping Tian1, Li-Zhi Liu2, Fang-Xiang Wu3.
Abstract
Microarray technology has produced a huge body of time-course gene expression data and will continue to produce more. Such gene expression data has been proved useful in genomic disease diagnosis and drug design. The challenge is how to uncover useful information from such data by proper analysis methods such as significance analysis and clustering analysis. Many statistic-based significance analysis methods and distance/correlation-based clustering analysis methods have been applied to time-course expression data. However, these techniques are unable to account for the dynamics of such data. It is the dynamics that characterizes such data and that should be considered in analysis of such data. In this paper, we employ a nonlinear model to analyse time-course gene expression data. We firstly develop an efficient method for estimating the parameters in the nonlinear model. Then we utilize this model to perform the significance analysis of individually differentially expressed genes and clustering analysis of a set of gene expression profiles. The verification with two synthetic datasets shows that our developed significance analysis method and cluster analysis method outperform some existing methods. The application to one real-life biological dataset illustrates that the analysis results of our developed methods are in agreement with the existing results.Entities:
Mesh:
Year: 2014 PMID: 24516364 PMCID: PMC3910117 DOI: 10.1155/2014/313747
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Plot of 4000 expression profiles for evaluating significance analysis method.
Figure 2Plot of sensitivity versus specificity.
Figure 3Plot of expression profiles for evaluating cluster analysis method.
Figure 4Plot of AARI with different numbers of clusters.
The values of AARI for different clustering methods on synthetic data.
| No. of clusters | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|
| Random initial | 0.2915 | 0.5741 | 0.6636 | 0.7549 | 0.9787 | 0.9516 | 0.8862 | 0.826 | 0.7944 |
|
| 0.2915 | 0.4875 | 0.6741 | 0.7168 | 0.7732 | 0.7668 | 0.7666 | 0.7739 | 0.753 |
|
| 0.2915 | 0.5099 | 0.6352 | 0.7047 | 0.8001 | 0.7635 | 0.8189 | 0.7849 | 0.7827 |
Figure 5Plot of gene expression profiles. (a)–(e) show gene expression profiles for one of five clusters. (f) shows gene expression profiles which are determined as noises.
The model parameters for each cluster.
| Parameters | Cluster 1 (315) | Cluster 2 (233) | Cluster 3 (17) | Cluster 4 (53) | Cluster 5 (228) |
|---|---|---|---|---|---|
|
| −1.1543 | −1.7033 | −0.6612 | −0.5111 | −1.8483 |
|
| 9.8108 | 9.8673 | 7.1631 | 7.0517 | 8.736 |
|
| 0.0234 | 0.2675 | 1.0948 | 0.0024 | 0.4427 |
|
| 0.1389 | 0.033 | −1.2261 | 0.1248 | −0.6807 |
|
| −0.1287 | 0.1422 | 0.3353 | 0.5748 | −0.3738 |
|
| 0.1383 | −0.1372 | −0.2723 | −0.6011 | 0.3946 |