| Literature DB >> 16046812 |
Halima Bensmail1, Buddana Aruna, O John Semmes, Abdelali Haoudi.
Abstract
Clustering proteomics data is a challenging problem for any traditional clustering algorithm. Usually, the number of samples is largely smaller than the number of protein peaks. The use of a clustering algorithm which does not take into consideration the number of features of variables (here the number of peaks) is needed. An innovative hierarchical clustering algorithm may be a good approach. We propose here a new dissimilarity measure for the hierarchical clustering combined with a functional data analysis. We present a specific application of functional data analysis (FDA) to a high-throughput proteomics study. The high performance of the proposed algorithm is compared to two popular dissimilarity measures in the clustering of normal and human T-cell leukemia virus type 1 (HTLV-1)-infected patients samples.Entities:
Year: 2005 PMID: 16046812 PMCID: PMC1184055 DOI: 10.1155/JBB.2005.80
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1Three cut expressions from a normal, an HAM, and an ATL patient.
Figure 2Original curve and a smoothed curve.
Figure 3Clustering proteomics data with Diana.
Figure 4Pattern recognition using dissimilarity matrix δ.
Figure 5Pattern recognition using δ.
Figure 6Pattern recognition using δ.
Figure 7Dendogram of the δ dissimilarity approach with Diana.
Confusion matrix to show the performance of δ using Diana.
| Predicted | |||||
|---|---|---|---|---|---|
| Classification | HAM | ATL | NOR | Total | |
| Clinical | HAM | 8 | 3 | 0 | 11 |
| ATL | 5 | 14 | 1 | 20 | |
| NOR | 1 | 2 | 34 | 37 | |
| Classification rate | 0.73 | 0.70 | 0.92 | 0.84 | |
Confusion matrix to show the performance of δ using Clara.
| Predicted | |||||
|---|---|---|---|---|---|
| Classification | HAM | ATL | NOR | Total | |
| Clinical | HAM | 10 | 1 | 0 | 11 |
| ATL | 2 | 18 | 0 | 20 | |
| NOR | 1 | 1 | 35 | 37 | |
| Classification rate | 0.91 | 0.90 | 0.95 | 0.93 | |
Figure 8The δ dissimilarity approach with Clara.