Literature DB >> 19443179

Another look at statistical learning theory and regularization.

Vladimir Cherkassky1, Yunqian Ma.   

Abstract

The paper reviews and highlights distinctions between function-approximation (FA) and VC theory and methodology, mainly within the setting of regression problems and a squared-error loss function, and illustrates empirically the differences between the two when data is sparse and/or input distribution is non-uniform. In FA theory, the goal is to estimate an unknown true dependency (or 'target' function) in regression problems, or posterior probability P(y/x) in classification problems. In VC theory, the goal is to 'imitate' unknown target function, in the sense of minimization of prediction risk or good 'generalization'. That is, the result of VC learning depends on (unknown) input distribution, while that of FA does not. This distinction is important because regularization theory originally introduced under clearly stated FA setting [Tikhonov, N. (1963). On solving ill-posed problem and method of regularization. Doklady Akademii Nauk USSR, 153, 501-504; Tikhonov, N., & V. Y. Arsenin (1977). Solution of ill-posed problems. Washington, DC: W. H. Winston], has been later used under risk-minimization or VC setting. More recently, several authors [Evgeniou, T., Pontil, M., & Poggio, T. (2000). Regularization networks and support vector machines. Advances in Computational Mathematics, 13, 1-50; Hastie, T., Tibshirani, R., & Friedman, J. (2001). The elements of statistical learning: Data mining, inference and prediction. Springer; Poggio, T. and Smale, S., (2003). The mathematics of learning: Dealing with data. Notices of the AMS, 50 (5), 537-544] applied constructive methodology based on regularization framework to learning dependencies from data (under VC-theoretical setting). However, such regularization-based learning is usually presented as a purely constructive methodology (with no clearly stated problem setting). This paper compares FA/regularization and VC/risk minimization methodologies in terms of underlying theoretical assumptions. The control of model complexity, using regularization and using the concept of margin in SVMs, is contrasted in the FA and VC formulations.

Entities:  

Mesh:

Year:  2009        PMID: 19443179     DOI: 10.1016/j.neunet.2009.04.005

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  12 in total

1.  Association of radiomic imaging features and gene expression profile as prognostic factors in pancreatic ductal adenocarcinoma.

Authors:  Ke Li; Jingjing Xiao; Jiali Yang; Meng Li; Xuanqi Xiong; Yongjian Nian; Linbo Qiao; Huaizhi Wang; Aydin Eresen; Zhuoli Zhang; Xianling Hu; Jian Wang; Wei Chen
Journal:  Am J Transl Res       Date:  2019-07-15       Impact factor: 4.060

2.  Four genetic variants interact to confer susceptibility to atopic dermatitis in Chinese Han population.

Authors:  Changbing Shen; Longdan Liu; Zengqiong Jiang; Xiaodong Zheng; Li Meng; Xianyong Yin; Jing Gao; Yujun Sheng; Jingping Gao; Yang Li; Fusheng Zhou; Fengli Xiao; Liangdan Sun; Yong Cui; Sen Yang; Xianbo Zuo; Xuejun Zhang
Journal:  Mol Genet Genomics       Date:  2015-02-25       Impact factor: 3.291

3.  Variants in TNFSF4, TNFAIP3, TNIP1, BLK, SLC15A4 and UBE2L3 interact to confer risk of systemic lupus erythematosus in Chinese population.

Authors:  Xian-Bo Zuo; Yu-Jun Sheng; Su-Juan Hu; Jin-Ping Gao; Yang Li; Hua-Yang Tang; Xian-Fa Tang; Hui Cheng; Xian-Yong Yin; Lei-Lei Wen; Liang-Dan Sun; Sen Yang; Yong Cui; Xue-Jun Zhang
Journal:  Rheumatol Int       Date:  2013-10-04       Impact factor: 2.631

4.  Eigenanatomy improves detection power for longitudinal cortical change.

Authors:  Brian Avants; Paramveer Dhillon; Benjamin M Kandel; Philip A Cook; Corey T McMillan; Murray Grossman; James C Gee
Journal:  Med Image Comput Comput Assist Interv       Date:  2012

5.  Using species distribution model to estimate the wintering population size of the endangered scaly-sided merganser in China.

Authors:  Qing Zeng; Yamian Zhang; Gongqi Sun; Hairui Duo; Li Wen; Guangchun Lei
Journal:  PLoS One       Date:  2015-02-03       Impact factor: 3.240

6.  Diagnostics for Statistical Variable Selection Methods for Prediction of Peptic Ulcer Disease in Helicobacter pylori Infection.

Authors:  Hyunsu Ju; Allan R Brasier; Alexander Kurosky; Bo Xu; Victor E Reyes; David Y Graham
Journal:  J Proteomics Bioinform       Date:  2014-04-01

7.  Disaggregating census data for population mapping using random forests with remotely-sensed and ancillary data.

Authors:  Forrest R Stevens; Andrea E Gaughan; Catherine Linard; Andrew J Tatem
Journal:  PLoS One       Date:  2015-02-17       Impact factor: 3.240

8.  Mining the Breast Cancer Proteome for Predictors of Drug Sensitivity.

Authors:  Leslie C Timpe; Dian Li; Ten-Yang Yen; Judi Wong; Roger Yen; Bruce A Macher; Alexandra Piryatinska
Journal:  J Proteomics Bioinform       Date:  2015

9.  Discovery of Small-Molecule Activators for Glucose-6-Phosphate Dehydrogenase (G6PD) Using Machine Learning Approaches.

Authors:  Madhu Sudhana Saddala; Anton Lennikov; Hu Huang
Journal:  Int J Mol Sci       Date:  2020-02-23       Impact factor: 5.923

10.  Multiscale modeling of the causal functional roles of nsSNPs in a genome-wide association study: application to hypoxia.

Authors:  Li Xie; Clara Ng; Thahmina Ali; Raoul Valencia; Barbara L Ferreira; Vincent Xue; Maliha Tanweer; Dan Zhou; Gabriel G Haddad; Philip E Bourne; Lei Xie
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.