Literature DB >> 29994750

On the Generalization Ability of Online Gradient Descent Algorithm Under the Quadratic Growth Condition.

Daqing Chang, Ming Lin, Changshui Zhang.   

Abstract

Online learning has been successfully applied in various machine learning problems. Conventional analysis of online learning achieves a sharp generalization bound with a strongly convex assumption. In this paper, we study the generalization ability of the classic online gradient descent algorithm under the quadratic growth condition (QGC), a strictly weaker condition than strong convexity. Under some mild assumptions, we prove that the excess risk converges no worse than $O(\log T/T)$ when the data are independently and identically distributed (i.i.d.). When the data are generated from a $\phi $ -mixing process, we achieve the excess risk bound $O(\log T /T+\phi (\tau))$ , where $\phi (\tau)$ is the mixing coefficient capturing the non-i.i.d. attribute. Our key technique is based on the combination of the QGC and the martingale concentrations. Our results indicate that the strong convexity is not necessary to achieve the sharp $O(\log {T}/T)$ convergence rate in online learning. We verify our theories on both synthetic and real-world data.

Entities:  

Mesh:

Year:  2018        PMID: 29994750      PMCID: PMC6237551          DOI: 10.1109/TNNLS.2017.2764960

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  5 in total

1.  Learning a Coupled Linearized Method in Online Setting.

Authors:  Wei Xue; Wensheng Zhang
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-01-22       Impact factor: 10.451

Review 2.  Deep learning in neural networks: an overview.

Authors:  Jürgen Schmidhuber
Journal:  Neural Netw       Date:  2014-10-13

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Memristor-based multilayer neural networks with online gradient descent training.

Authors:  Daniel Soudry; Dotan Di Castro; Asaf Gal; Avinoam Kolodny; Shahar Kvatinsky
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2015-01-14       Impact factor: 10.451

5.  A Note on the Unification of Adaptive Online Learning.

Authors:  Wenwu He; James Tin-Yau Kwok; Ji Zhu; Yang Liu
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-02-24       Impact factor: 10.451

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.