Literature DB >> 15794164

Fast SVM training algorithm with decomposition on very large data sets.

A Krzyzak, C Y Suen.   

Abstract

Training a support vector machine on a data set of huge size with thousands of classes is a challenging problem. This paper proposes an efficient algorithm to solve this problem. The key idea is to introduce a parallel optimization step to quickly remove most of the nonsupport vectors, where block diagonal matrices are used to approximate the original kernel matrix so that the original problem can be split into hundreds of subproblems which can be solved more efficiently. In addition, some effective strategies such as kernel caching and efficient computation of kernel matrix are integrated to speed up the training process. Our analysis of the proposed algorithm shows that its time complexity grows linearly with the number of classes and size of the data set. In the experiments, many appealing properties of the proposed algorithm have been investigated and the results show that the proposed algorithm has a much better scaling capability than Libsvm, SVMlight, and SVMTorch. Moreover, the good generalization performances on several large databases have also been achieved.

Mesh:

Year:  2005        PMID: 15794164     DOI: 10.1109/TPAMI.2005.77

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  5 in total

1.  Screening and validation for plasma biomarkers of nephrotoxicity based on metabolomics in male rats.

Authors:  Yubo Li; Haoyue Deng; Liang Ju; Xiuxiu Zhang; Zhenzhu Zhang; Zhen Yang; Lei Wang; Zhiguo Hou; Yanjun Zhang
Journal:  Toxicol Res (Camb)       Date:  2015-11-05       Impact factor: 3.524

2.  AdaBoost based multi-instance transfer learning for predicting proteome-wide interactions between Salmonella and human proteins.

Authors:  Suyu Mei; Hao Zhu
Journal:  PLoS One       Date:  2014-10-17       Impact factor: 3.240

3.  Multi-label multi-instance transfer learning for simultaneous reconstruction and cross-talk modeling of multiple human signaling pathways.

Authors:  Suyu Mei; Hao Zhu
Journal:  BMC Bioinformatics       Date:  2015-12-30       Impact factor: 3.169

4.  Probability weighted ensemble transfer learning for predicting interactions between HIV-1 and human proteins.

Authors:  Suyu Mei
Journal:  PLoS One       Date:  2013-11-18       Impact factor: 3.240

5.  The Caenorhabditis elegans Lifespan Machine.

Authors:  Nicholas Stroustrup; Bryne E Ulmschneider; Zachary M Nash; Isaac F López-Moyado; Javier Apfeld; Walter Fontana
Journal:  Nat Methods       Date:  2013-05-12       Impact factor: 28.547

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.