Literature DB >> 22156101

Aggregating local image descriptors into compact codes.

Hervé Jégou1, Florent Perronnin, Matthijs Douze, Jorge Sánchez, Patrick Pérez, Cordelia Schmid.   

Abstract

This paper addresses the problem of large-scale image search. Three constraints have to be taken into account: search accuracy, efficiency, and memory usage. We first present and evaluate different ways of aggregating local image descriptors into a vector and show that the Fisher kernel achieves better performance than the reference bag-of-visual words approach for any given vector dimension. We then jointly optimize dimensionality reduction and indexing in order to obtain a precise vector comparison as well as a compact representation. The evaluation shows that the image representation can be reduced to a few dozen bytes while preserving high accuracy. Searching a 100 million image data set takes about 250 ms on one processor core.

Year:  2012        PMID: 22156101     DOI: 10.1109/TPAMI.2011.235

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  22 in total

1.  Instance Search Retrospective with Focus on TRECVID.

Authors:  George Awad; Wessel Kraaij; Paul Over; Shin'ichi Satoh
Journal:  Int J Multimed Inf Retr       Date:  2017-02-22

2.  Dictionary Pruning with Visual Word Significance for Medical Image Retrieval.

Authors:  Fan Zhang; Yang Song; Weidong Cai; Alexander G Hauptmann; Sidong Liu; Sonia Pujol; Ron Kikinis; Michael J Fulham; David Dagan Feng; Mei Chen
Journal:  Neurocomputing       Date:  2015-11-17       Impact factor: 5.719

3.  Computational Analysis of Cell Dynamics in Videos with Hierarchical-Pooled Deep-Convolutional Features.

Authors:  Fengqian Pang; Heng Li; Yonggang Shi; Zhiwen Liu
Journal:  J Comput Biol       Date:  2018-04-25       Impact factor: 1.479

4.  Low-level contrast statistics are diagnostic of invariance of natural textures.

Authors:  Iris I A Groen; Sennay Ghebreab; Victor A F Lamme; H Steven Scholte
Journal:  Front Comput Neurosci       Date:  2012-06-08       Impact factor: 2.380

5.  Encoder-Decoder Full Residual Deep Networks for Robust Regression and Spatiotemporal Estimation.

Authors:  Lianfa Li; Ying Fang; Jun Wu; Jinfeng Wang; Yong Ge
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2021-08-31       Impact factor: 14.255

6.  A Probabilistic Analysis of Sparse Coded Feature Pooling and Its Application for Image Retrieval.

Authors:  Yunchao Zhang; Jing Chen; Xiujie Huang; Yongtian Wang
Journal:  PLoS One       Date:  2015-07-01       Impact factor: 3.240

7.  Automatic Recognition of Fetal Facial Standard Plane in Ultrasound Image via Fisher Vector.

Authors:  Baiying Lei; Ee-Leng Tan; Siping Chen; Liu Zhuo; Shengli Li; Dong Ni; Tianfu Wang
Journal:  PLoS One       Date:  2015-05-01       Impact factor: 3.240

8.  Visual dictionaries as intermediate features in the human brain.

Authors:  Kandan Ramakrishnan; H Steven Scholte; Iris I A Groen; Arnold W M Smeulders; Sennay Ghebreab
Journal:  Front Comput Neurosci       Date:  2015-01-15       Impact factor: 2.380

9.  Discriminative Learning for Alzheimer's Disease Diagnosis via Canonical Correlation Analysis and Multimodal Fusion.

Authors:  Baiying Lei; Siping Chen; Dong Ni; Tianfu Wang
Journal:  Front Aging Neurosci       Date:  2016-05-17       Impact factor: 5.750

10.  Discriminative Learning for Automatic Staging of Placental Maturity via Multi-layer Fisher Vector.

Authors:  Baiying Lei; Yuan Yao; Siping Chen; Shengli Li; Wanjun Li; Dong Ni; Tianfu Wang
Journal:  Sci Rep       Date:  2015-07-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.