Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.

Literature DB >> 29074582

A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.

Dileep George¹, Wolfgang Lehrach², Ken Kansky², Miguel Lázaro-Gredilla¹, Christopher Laan², Bhaskara Marthi², Xinghua Lou², Zhaoshi Meng², Yi Liu², Huayan Wang², Alex Lavin², D Scott Phoenix².

Abstract

Learning from a few examples and generalizing to markedly different situations are capabilities of human visual intelligence that are yet to be matched by leading machine learning models. By drawing inspiration from systems neuroscience, we introduce a probabilistic generative model for vision in which message-passing-based inference handles recognition, segmentation, and reasoning in a unified way. The model demonstrates excellent generalization and occlusion-reasoning capabilities and outperforms deep neural networks on a challenging scene text recognition benchmark while being 300-fold more data efficient. In addition, the model fundamentally breaks the defense of modern text-based CAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart) by generatively segmenting characters without CAPTCHA-specific heuristics. Our model emphasizes aspects such as data efficiency and compositionality that may be important in the path toward general artificial intelligence.

Entities: Species

Mesh：

Year: 2017 PMID： 29074582 DOI： 10.1126/science.aag2612

Source DB: PubMed Journal: Science ISSN： 0036-8075 Impact factor: 47.728

Keyword Cloud
Cited

13 in total

1. Sensitivity to geometric shape regularity in humans and baboons: A putative signature of human singularity.

Authors: Mathias Sablé-Meyer; Joël Fagot; Serge Caparos; Timo van Kerkoerle; Marie Amalric; Stanislas Dehaene
Journal: Proc Natl Acad Sci U S A Date: 2021-04-20 Impact factor: 11.205

Review 2. Crossing the Cleft: Communication Challenges Between Neuroscience and Artificial Intelligence.

Authors: Frances S Chance; James B Aimone; Srideep S Musuvathy; Michael R Smith; Craig M Vineyard; Felix Wang
Journal: Front Comput Neurosci Date: 2020-05-06 Impact factor: 2.380

3. ToyArchitecture: Unsupervised learning of interpretable models of the environment.

Authors: Jaroslav Vítků; Petr Dluhoš; Joseph Davidson; Matěj Nikl; Simon Andersson; Přemysl Paška; Jan Šinkora; Petr Hlubuček; Martin Stránský; Martin Hyben; Martin Poliak; Jan Feyereisl; Marek Rosa
Journal: PLoS One Date: 2020-05-18 Impact factor: 3.240

4. High-throughput brain activity mapping and machine learning as a foundation for systems neuropharmacology.

Authors: Xudong Lin; Xin Duan; Claire Jacobs; Jeremy Ullmann; Chung-Yuen Chan; Siya Chen; Shuk-Han Cheng; Wen-Ning Zhao; Annapurna Poduri; Xin Wang; Stephen J Haggarty; Peng Shi
Journal: Nat Commun Date: 2018-12-03 Impact factor: 14.919

5. Accurate, fast, data efficient and interpretable glaucoma diagnosis with automated spatial analysis of the whole cup to disc profile.

Authors: Ian J C MacCormick; Bryan M Williams; Yalin Zheng; Kun Li; Baidaa Al-Bander; Silvester Czanner; Rob Cheeseman; Colin E Willoughby; Emery N Brown; George L Spaeth; Gabriela Czanner
Journal: PLoS One Date: 2019-01-10 Impact factor: 3.240

A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs.

1. Sensitivity to geometric shape regularity in humans and baboons: A putative signature of human singularity.

Review 2. Crossing the Cleft: Communication Challenges Between Neuroscience and Artificial Intelligence.

3. ToyArchitecture: Unsupervised learning of interpretable models of the environment.

4. High-throughput brain activity mapping and machine learning as a foundation for systems neuropharmacology.

5. Accurate, fast, data efficient and interpretable glaucoma diagnosis with automated spatial analysis of the whole cup to disc profile.

6. Recurrence is required to capture the representational dynamics of the human visual system.

7. Clone-structured graph representations enable flexible learning and vicarious evaluation of cognitive maps.

8. Sharpening of Hierarchical Visual Feature Representations of Blurred Images.

9. Awareness as inference in a higher-order state space.

10. From CAPTCHA to Commonsense: How Brain Can Teach Us About Artificial Intelligence.