Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Towards artificial general intelligence via a multimodal foundation model.

Literature DB >> 35655064

Towards artificial general intelligence via a multimodal foundation model.

Nanyi Fei^1,2,3, Zhiwu Lu^4,5, Yizhao Gao^1,2, Guoxing Yang^1,2, Yuqi Huo^2,3, Jingyuan Wen^1,2, Haoyu Lu^1,2, Ruihua Song^1,2, Xin Gao⁶, Tao Xiang⁷, Hao Sun^8,9, Ji-Rong Wen^10,11,12.

Abstract

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of "weak or narrow AI" to that of "strong or generalized AI".

Entities: Chemical

Mesh：

Year: 2022 PMID： 35655064 PMCID： PMC9163040 DOI： 10.1038/s41467-022-30761-2

Source DB: PubMed Journal: Nat Commun ISSN： 2041-1723 Impact factor: 17.694

Keyword Cloud
References

5 in total

Review 1. Deep learning.

Authors: Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal: Nature Date: 2015-05-28 Impact factor: 49.962

2. Zero and Few Shot Learning With Semantic Feature Synthesis and Competitive Learning.

Authors: Jiechao Guan; Zhiwu Lu; Tao Xiang; Aoxue Li; An Zhao; Ji-Rong Wen
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2021-06-08 Impact factor: 6.226

3. Two Stream Active Query Suggestion for Active Learning in Connectomics.

Authors: Zudi Lin; Donglai Wei; Won-Dong Jang; Siyan Zhou; Xupeng Chen; Xueying Wang; Richard Schalek; Daniel Berger; Brian Matejek; Lee Kamentsky; Adi Peleg; Daniel Haehn; Thouis Jones; Toufiq Parag; Jeff Lichtman; Hanspeter Pfister
Journal: Comput Vis ECCV Date: 2020-12-04

4. Invariant visual representation by single neurons in the human brain.

Authors: R Quian Quiroga; L Reddy; G Kreiman; C Koch; I Fried
Journal: Nature Date: 2005-06-23 Impact factor: 49.962

5. Explicit encoding of multimodal percepts by single neurons in the human brain.

Authors: Rodrigo Quian Quiroga; Alexander Kraskov; Christof Koch; Itzhak Fried
Journal: Curr Biol Date: 2009-07-23 Impact factor: 10.834

5 in total