Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 On the Generalization Ability of Online Gradient Descent Algorithm Under the Quadratic Growth Condition.

Literature DB >> 29994750

On the Generalization Ability of Online Gradient Descent Algorithm Under the Quadratic Growth Condition.

Daqing Chang, Ming Lin, Changshui Zhang.

Abstract

Online learning has been successfully applied in various machine learning problems. Conventional analysis of online learning achieves a sharp generalization bound with a strongly convex assumption. In this paper, we study the generalization ability of the classic online gradient descent algorithm under the quadratic growth condition (QGC), a strictly weaker condition than strong convexity. Under some mild assumptions, we prove that the excess risk converges no worse than $O(\log T/T)$ when the data are independently and identically distributed (i.i.d.). When the data are generated from a $\phi $ -mixing process, we achieve the excess risk bound $O(\log T /T+\phi (\tau))$ , where $\phi (\tau)$ is the mixing coefficient capturing the non-i.i.d. attribute. Our key technique is based on the combination of the QGC and the martingale concentrations. Our results indicate that the strong convexity is not necessary to achieve the sharp $O(\log {T}/T)$ convergence rate in online learning. We verify our theories on both synthetic and real-world data.

Entities: Chemical Disease Gene

Mesh：

Year: 2018 PMID： 29994750 PMCID： PMC6237551 DOI： 10.1109/TNNLS.2017.2764960

Source DB: PubMed Journal: IEEE Trans Neural Netw Learn Syst ISSN： 2162-237X Impact factor: 10.451

Keyword Cloud
References

5 in total

1. Learning a Coupled Linearized Method in Online Setting.

Authors: Wei Xue; Wensheng Zhang
Journal: IEEE Trans Neural Netw Learn Syst Date: 2016-01-22 Impact factor: 10.451

Review 2. Deep learning in neural networks: an overview.

Authors: Jürgen Schmidhuber
Journal: Neural Netw Date: 2014-10-13

Review 3. Deep learning.

Authors: Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal: Nature Date: 2015-05-28 Impact factor: 49.962

4. Memristor-based multilayer neural networks with online gradient descent training.

Authors: Daniel Soudry; Dotan Di Castro; Asaf Gal; Avinoam Kolodny; Shahar Kvatinsky
Journal: IEEE Trans Neural Netw Learn Syst Date: 2015-01-14 Impact factor: 10.451

5. A Note on the Unification of Adaptive Online Learning.

Authors: Wenwu He; James Tin-Yau Kwok; Ji Zhu; Yang Liu
Journal: IEEE Trans Neural Netw Learn Syst Date: 2016-02-24 Impact factor: 10.451

5 in total