Literature DB >> 17282075

A Protein Classification Method Based on Latent Semantic Analysis<sup>*</sup>.

Yongsheng Yuan, Lei Lin, Qiwen Dong, Xiaolong Wang, Minghui Li.   

Abstract

In this paper a new method that uses Latent Semantic Analysis (LSA) to denote a protein sequence is proposed for researching the protein classification problem. A protein is vectorized according to its content of biological words: patterns and motifs, which are generated by utilizing TEIRESIAS algorithm and MEME/MAST system respectively. More precise description vectors of proteins are obtained through employing LSA. Those vectors are used to classify proteins combined with the Support Vector Machine (SVM). Experiments of family-level protein classification on Structural Classification of Proteins database show that the performance of this method is better than that of the other state-of-the-arts methods.

Year:  2005        PMID: 17282075     DOI: 10.1109/IEMBS.2005.1616306

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  1 in total

1.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.