Literature DB >> 27639721

Piece-wise quadratic approximations of arbitrary error functions for fast and robust machine learning.

A N Gorban1, E M Mirkes2, A Zinovyev3.   

Abstract

Most of machine learning approaches have stemmed from the application of minimizing the mean squared distance principle, based on the computationally efficient quadratic optimization methods. However, when faced with high-dimensional and noisy data, the quadratic error functionals demonstrated many weaknesses including high sensitivity to contaminating factors and dimensionality curse. Therefore, a lot of recent applications in machine learning exploited properties of non-quadratic error functionals based on L1 norm or even sub-linear potentials corresponding to quasinorms Lp (0<p<1). The back side of these approaches is increase in computational cost for optimization. Till so far, no approaches have been suggested to deal with arbitrary error functionals, in a flexible and computationally efficient framework. In this paper, we develop a theory and basic universal data approximation algorithms (k-means, principal components, principal manifolds and graphs, regularized and sparse regression), based on piece-wise quadratic error potentials of subquadratic growth (PQSQ potentials). We develop a new and universal framework to minimize arbitrary sub-quadratic error potentials using an algorithm with guaranteed fast convergence to the local or global error minimum. The theory of PQSQ potentials is based on the notion of the cone of minorant functions, and represents a natural approximation formalism based on the application of min-plus algebra. The approach can be applied in most of existing machine learning methods, including methods of data approximation and regularized and sparse regression, leading to the improvement in the computational cost/accuracy trade-off. We demonstrate that on synthetic and real-life datasets PQSQ-based machine learning methods achieve orders of magnitude faster computational performance than the corresponding state-of-the-art methods, having similar or better approximation accuracy.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Keywords:  Clustering; Data approximation; Nonquadratic potential; Principal components; Regularized regression; Sparse regression

Mesh:

Year:  2016        PMID: 27639721     DOI: 10.1016/j.neunet.2016.08.007

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  2 in total

Review 1.  Blessing of dimensionality: mathematical foundations of the statistical physics of data.

Authors:  A N Gorban; I Y Tyukin
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2018-04-28       Impact factor: 4.226

2.  Robust and Scalable Learning of Complex Intrinsic Dataset Geometry via ElPiGraph.

Authors:  Luca Albergante; Evgeny Mirkes; Jonathan Bac; Huidong Chen; Alexis Martin; Louis Faure; Emmanuel Barillot; Luca Pinello; Alexander Gorban; Andrei Zinovyev
Journal:  Entropy (Basel)       Date:  2020-03-04       Impact factor: 2.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.