Literature DB >> 18276975

Scene classification using a hybrid generative/discriminative approach.

Anna Bosch1, Andrew Zisserman, Xavier Muñoz.   

Abstract

We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail we are given a set of labelled images of scenes (e.g. coast, forest, city, river, etc) and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent "topics" using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently training a multi-way classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly, and training a multi-way classifier on these vectors. To this end we introduce a novel vocabulary using dense colour SIFT descriptors, and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learnt, and the type of discriminative classifier used (k-nearest neighbour or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases using the authors' own datasets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos.

Entities:  

Mesh:

Year:  2008        PMID: 18276975     DOI: 10.1109/TPAMI.2007.70716

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  11 in total

1.  Pairwise Latent Semantic Association for Similarity Computation in Medical Imaging.

Authors:  Fan Zhang; Yang Song; Weidong Cai; Sidong Liu; Siqi Liu; Sonia Pujol; Ron Kikinis; Yong Xia; Michael J Fulham; David Dagan Feng
Journal:  IEEE Trans Biomed Eng       Date:  2015-09-10       Impact factor: 4.538

2.  Classification of Tumor Histology via Morphometric Context.

Authors:  Hang Chang; Alexander Borowsky; Paul Spellman; Bahram Parvin
Journal:  Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit       Date:  2013-06-23

3.  Generative-discriminative basis learning for medical imaging.

Authors:  Nematollah K Batmanghelich; Ben Taskar; Christos Davatzikos
Journal:  IEEE Trans Med Imaging       Date:  2011-07-25       Impact factor: 10.048

4.  Stacked Predictive Sparse Decomposition for Classification of Histology Sections.

Authors:  Hang Chang; Yin Zhou; Alexander Borowsky; Kenneth Barner; Paul Spellman; Bahram Parvin
Journal:  Int J Comput Vis       Date:  2014-12-23       Impact factor: 7.410

5.  Stacked Predictive Sparse Coding for Classification of Distinct Regions of Tumor Histopathology.

Authors:  Hang Chang; Yin Zhou; Paul Spellman; Bahram Parvin
Journal:  Proc IEEE Int Conf Comput Vis       Date:  2013

6.  Modeling Search for People in 900 Scenes: A combined source model of eye guidance.

Authors:  Krista A Ehinger; Barbara Hidalgo-Sotelo; Antonio Torralba; Aude Oliva
Journal:  Vis cogn       Date:  2009-08-01

7.  Dictionary Pruning with Visual Word Significance for Medical Image Retrieval.

Authors:  Fan Zhang; Yang Song; Weidong Cai; Alexander G Hauptmann; Sidong Liu; Sonia Pujol; Ron Kikinis; Michael J Fulham; David Dagan Feng; Mei Chen
Journal:  Neurocomputing       Date:  2015-11-17       Impact factor: 5.719

8.  Generative embedding for model-based classification of fMRI data.

Authors:  Kay H Brodersen; Thomas M Schofield; Alexander P Leff; Cheng Soon Ong; Ekaterina I Lomakina; Joachim M Buhmann; Klaas E Stephan
Journal:  PLoS Comput Biol       Date:  2011-06-23       Impact factor: 4.475

9.  Subjective Ratings of Beauty and Aesthetics: Correlations With Statistical Image Properties in Western Oil Paintings.

Authors:  Gregor U Hayn-Leichsenring; Thomas Lehmann; Christoph Redies
Journal:  Iperception       Date:  2017-06-28

10.  An efficient image descriptor for image classification and CBIR.

Authors:  Ashkan Shakarami; Hadis Tarrah
Journal:  Optik (Stuttg)       Date:  2020-05-04       Impact factor: 2.443

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.