Literature DB >> 29074582

A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.

Dileep George1, Wolfgang Lehrach2, Ken Kansky2, Miguel Lázaro-Gredilla1, Christopher Laan2, Bhaskara Marthi2, Xinghua Lou2, Zhaoshi Meng2, Yi Liu2, Huayan Wang2, Alex Lavin2, D Scott Phoenix2.   

Abstract

Learning from a few examples and generalizing to markedly different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing-based inference handles recognition, segmentation, and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart) by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects such as data efficiency and compositionality that may be important in the path toward general artificial intelligence.
Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

Entities:  

Mesh:

Year:  2017        PMID: 29074582     DOI: 10.1126/science.aag2612

Source DB:  PubMed          Journal:  Science        ISSN: 0036-8075            Impact factor:   47.728


  13 in total

1.  Sensitivity to geometric shape regularity in humans and baboons: A putative signature of human singularity.

Authors:  Mathias Sablé-Meyer; Joël Fagot; Serge Caparos; Timo van Kerkoerle; Marie Amalric; Stanislas Dehaene
Journal:  Proc Natl Acad Sci U S A       Date:  2021-04-20       Impact factor: 11.205

Review 2.  Crossing the Cleft: Communication Challenges Between Neuroscience and Artificial Intelligence.

Authors:  Frances S Chance; James B Aimone; Srideep S Musuvathy; Michael R Smith; Craig M Vineyard; Felix Wang
Journal:  Front Comput Neurosci       Date:  2020-05-06       Impact factor: 2.380

3.  ToyArchitecture: Unsupervised learning of interpretable models of the environment.

Authors:  Jaroslav Vítků; Petr Dluhoš; Joseph Davidson; Matěj Nikl; Simon Andersson; Přemysl Paška; Jan Šinkora; Petr Hlubuček; Martin Stránský; Martin Hyben; Martin Poliak; Jan Feyereisl; Marek Rosa
Journal:  PLoS One       Date:  2020-05-18       Impact factor: 3.240

4.  High-throughput brain activity mapping and machine learning as a foundation for systems neuropharmacology.

Authors:  Xudong Lin; Xin Duan; Claire Jacobs; Jeremy Ullmann; Chung-Yuen Chan; Siya Chen; Shuk-Han Cheng; Wen-Ning Zhao; Annapurna Poduri; Xin Wang; Stephen J Haggarty; Peng Shi
Journal:  Nat Commun       Date:  2018-12-03       Impact factor: 14.919

5.  Accurate, fast, data efficient and interpretable glaucoma diagnosis with automated spatial analysis of the whole cup to disc profile.

Authors:  Ian J C MacCormick; Bryan M Williams; Yalin Zheng; Kun Li; Baidaa Al-Bander; Silvester Czanner; Rob Cheeseman; Colin E Willoughby; Emery N Brown; George L Spaeth; Gabriela Czanner
Journal:  PLoS One       Date:  2019-01-10       Impact factor: 3.240

6.  Recurrence is required to capture the representational dynamics of the human visual system.

Authors:  Tim C Kietzmann; Courtney J Spoerer; Lynn K A Sörensen; Radoslaw M Cichy; Olaf Hauk; Nikolaus Kriegeskorte
Journal:  Proc Natl Acad Sci U S A       Date:  2019-10-07       Impact factor: 11.205

7.  Clone-structured graph representations enable flexible learning and vicarious evaluation of cognitive maps.

Authors:  Dileep George; Rajeev V Rikhye; Nishad Gothoskar; J Swaroop Guntupalli; Antoine Dedieu; Miguel Lázaro-Gredilla
Journal:  Nat Commun       Date:  2021-04-22       Impact factor: 14.919

8.  Sharpening of Hierarchical Visual Feature Representations of Blurred Images.

Authors:  Mohamed Abdelhack; Yukiyasu Kamitani
Journal:  eNeuro       Date:  2018-05-08

9.  Awareness as inference in a higher-order state space.

Authors:  Stephen M Fleming
Journal:  Neurosci Conscious       Date:  2020-03-11

10.  From CAPTCHA to Commonsense: How Brain Can Teach Us About Artificial Intelligence.

Authors:  Dileep George; Miguel Lázaro-Gredilla; J Swaroop Guntupalli
Journal:  Front Comput Neurosci       Date:  2020-10-22       Impact factor: 2.380

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.