Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Towards automatic clustering of protein sequences.

Literature DB >> 15838134

Towards automatic clustering of protein sequences.

Abstract

Analyzing protein sequence data becomes increasingly important recently. Most previous work on this area has mainly focused on building classification models. In this paper, we investigate in the problem of automatic clustering of unlabeled protein sequences. As a widely recognized technique in statistics and computer science, clustering has been proven very useful in detecting unknown object categories and revealing hidden correlations among objects. One difficulty that prevents clustering from being performed directly on protein sequence is the lack of an effective similarity measure that can be computed efficiently. Therefore, we propose a novel model for protein sequence cluster by exploring significant statistical properties possessed by the sequences. The concept of imprecise probabilities are introduced to the original probabilistic suffix tree to monitor the convergence of the empirical measurement and to guide the clustering process. It has been demonstrated that the proposed method can successfully discover meaningful families without the necessity of learning models of different families from pre-labeled "training data".

Mesh：

Substances：
Proteins

Year: 2002 PMID： 15838134

Source DB: PubMed Journal: Proc IEEE Comput Soc Bioinform Conf ISSN： 1555-3930

Keyword Cloud
Cited

2 in total

1. Discovering Activities to Recognize and Track in a Smart Environment.

Authors: Parisa Rashidi; Diane J Cook; Lawrence B Holder; Maureen Schmitter-Edgecombe
Journal: IEEE Trans Knowl Data Eng Date: 2011 Impact factor: 6.977

2. Deep-sequencing of the peach latent mosaic viroid reveals new aspects of population heterogeneity.

Authors: Jean-Pierre Sehi Glouzon; François Bolduc; Shengrui Wang; Rafael J Najmanovich; Jean-Pierre Perreault
Journal: PLoS One Date: 2014-01-30 Impact factor: 3.240

2 in total