Literature DB >> 24678270

On Efficient Large Margin Semisupervised Learning: Method and Theory.

Junhui Wang1, Xiaotong Shen2, Wei Pan3.   

Abstract

In classification, semisupervised learning usually involves a large amount of unlabeled data with only a small number of labeled data. This imposes a great challenge in that it is difficult to achieve good classification performance through labeled data alone. To leverage unlabeled data for enhancing classification, this article introduces a large margin semisupervised learning method within the framework of regularization, based on an efficient margin loss for unlabeled data, which seeks efficient extraction of the information from unlabeled data for estimating the Bayes decision boundary for classification. For implementation, an iterative scheme is derived through conditional expectations. Finally, theoretical and numerical analyses are conducted, in addition to an application to gene function prediction. They suggest that the proposed method enables to recover the performance of its supervised counterpart based on complete data in rates of convergence, when possible.

Entities:  

Keywords:  classification; difference convex programming; nonconvex minimization; regularization; support vectors

Year:  2009        PMID: 24678270      PMCID: PMC3964604     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  7 in total

1.  Using the Fisher kernel method to detect remote protein homologies.

Authors:  T Jaakkola; M Diekhans; D Haussler
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1999

2.  New support vector algorithms

Authors: 
Journal:  Neural Comput       Date:  2000-05       Impact factor: 2.026

3.  Functional discovery via a compendium of expression profiles.

Authors:  T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal:  Cell       Date:  2000-07-07       Impact factor: 41.582

4.  A system of scoring linkage data, with special reference to the pied factors in mice.

Authors:  R A FISHER
Journal:  Am Nat       Date:  1946 Sep-Oct       Impact factor: 3.926

5.  Gene function prediction by a combined analysis of gene expression data and protein-protein interaction data.

Authors:  Guanghua Xiao; Wei Pan
Journal:  J Bioinform Comput Biol       Date:  2005-12       Impact factor: 1.122

6.  MIPS: a database for protein sequences, homology data and yeast genome information.

Authors:  H W Mewes; K Albermann; K Heumann; S Liebl; F Pfeiffer
Journal:  Nucleic Acids Res       Date:  1997-01-01       Impact factor: 16.971

7.  Transitive functional annotation by shortest-path analysis of gene expression data.

Authors:  Xianghong Zhou; Ming-Chih J Kao; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-26       Impact factor: 11.205

  7 in total
  4 in total

1.  Targeted Local Support Vector Machine for Age-Dependent Classification.

Authors:  Tianle Chen; Yuanjia Wang; Huaihou Chen; Karen Marder; Donglin Zeng
Journal:  J Am Stat Assoc       Date:  2014-09-01       Impact factor: 5.033

2.  Auxiliary marker-assisted classification in the absence of class identifiers.

Authors:  Yuanjia Wang; Huaihou Chen; Donglin Zeng; Christine Mauro; Naihua Duan; M Katherine Shear
Journal:  J Am Stat Assoc       Date:  2013-06-01       Impact factor: 5.033

3.  Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data.

Authors:  Juhyeon Kim; Hyunjung Shin
Journal:  J Am Med Inform Assoc       Date:  2013-03-06       Impact factor: 4.497

4.  A coupling approach of a predictor and a descriptor for breast cancer prognosis.

Authors:  Hyunjung Shin; Yonghyun Nam
Journal:  BMC Med Genomics       Date:  2014-05-08       Impact factor: 3.063

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.