Literature DB >> 18252607

Support vector machines for spam categorization.

H Drucker1, D Wu, V N Vapnik.   

Abstract

We study the use of support vector machines (SVM's) in classifying e-mail as spam or nonspam by comparing it to three other classification algorithms: Ripper, Rocchio, and boosting decision trees. These four algorithms were tested on two different data sets: one data set where the number of features were constrained to the 1000 best features and another data set where the dimensionality was over 7000. SVM's performed best when using binary features. For both data sets, boosting trees and SVM's had acceptable test performance in terms of accuracy and speed. However, SVM's had significantly less training time.

Entities:  

Year:  1999        PMID: 18252607     DOI: 10.1109/72.788645

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  25 in total

1.  Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Authors:  Lian Yi Han; Cong Zhong Cai; Siew Lin Lo; Maxey C M Chung; Yu Zong Chen
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

2.  Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection.

Authors:  Taxiarchis Botsis; Michael D Nguyen; Emily Jane Woo; Marianthi Markatou; Robert Ball
Journal:  J Am Med Inform Assoc       Date:  2011-06-27       Impact factor: 4.497

3.  Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression.

Authors:  Daniel Westreich; Justin Lessler; Michele Jonsson Funk
Journal:  J Clin Epidemiol       Date:  2010-08       Impact factor: 6.437

4.  Natural language processing: state of the art, current trends and challenges.

Authors:  Diksha Khurana; Aditya Koli; Kiran Khatter; Sukhdev Singh
Journal:  Multimed Tools Appl       Date:  2022-07-14       Impact factor: 2.577

5.  A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network.

Authors:  Zhu-Hong You; Zheng Yin; Kyungsook Han; De-Shuang Huang; Xiaobo Zhou
Journal:  BMC Bioinformatics       Date:  2010-06-24       Impact factor: 3.169

6.  Splice site identification using probabilistic parameters and SVM classification.

Authors:  A K M A Baten; B C H Chang; S K Halgamuge; Jason Li
Journal:  BMC Bioinformatics       Date:  2006-12-18       Impact factor: 3.169

7.  Simple-random-sampling-based multiclass text classification algorithm.

Authors:  Wuying Liu; Lin Wang; Mianzhu Yi
Journal:  ScientificWorldJournal       Date:  2014-03-19

8.  Intelligent screening systems for cervical cancer.

Authors:  Yessi Jusman; Siew Cheok Ng; Noor Azuan Abu Osman
Journal:  ScientificWorldJournal       Date:  2014-05-11

9.  Fast splice site detection using information content and feature reduction.

Authors:  A K M A Baten; S K Halgamuge; B C H Chang
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

10.  A novel approach for prediction of vitamin d status using support vector regression.

Authors:  Shuyu Guo; Robyn M Lucas; Anne-Louise Ponsonby
Journal:  PLoS One       Date:  2013-11-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.