Literature DB >> 17356204

Sharing visual features for multiclass and multiview object detection.

Antonio Torralba1, Kevin P Murphy, William T Freeman.   

Abstract

We consider the problem of detecting a large number of different classes of objects in cluttered scenes. Traditional approaches require applying a battery of different classifiers to the image, at multiple locations and scales. This can be slow and can require a lot of training data since each classifier requires the computation of many different image features. In particular, for independently trained detectors, the (runtime) computational complexity and the (training-time) sample complexity scale linearly with the number of classes to be detected. We present a multitask learning procedure, based on boosted decision stumps, that reduces the computational and sample complexity by finding common features that can be shared across the classes (and/or views). The detectors for each class are trained jointly, rather than independently. For a given performance level, the total number of features required and, therefore, the runtime cost of the classifier, is observed to scale approximately logarithmically with the number of classes. The features selected by joint training are generic edge-like features, whereas the features chosen by training each class separately tend to be more object-specific. The generic features generalize better and considerably reduce the computational cost of multiclass object detection.

Mesh:

Year:  2007        PMID: 17356204     DOI: 10.1109/TPAMI.2007.1055

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  11 in total

1.  Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning.

Authors:  Thouis R Jones; Anne E Carpenter; Michael R Lamprecht; Jason Moffat; Serena J Silver; Jennifer K Grenier; Adam B Castoreno; Ulrike S Eggert; David E Root; Polina Golland; David M Sabatini
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-02       Impact factor: 11.205

2.  Generalization between canonical and non-canonical views in object recognition.

Authors:  Tandra Ghose; Zili Liu
Journal:  J Vis       Date:  2013-01-02       Impact factor: 2.240

3.  Modeling Search for People in 900 Scenes: A combined source model of eye guidance.

Authors:  Krista A Ehinger; Barbara Hidalgo-Sotelo; Antonio Torralba; Aude Oliva
Journal:  Vis cogn       Date:  2009-08-01

4.  Passive and In-situ Assessment of Mental and Physical Well-being using Mobile Sensors.

Authors:  Mashfiqui Rabbi; Shahid Ali; Tanzeem Choudhury; Ethan Berke
Journal:  Proc ACM Int Conf Ubiquitous Comput       Date:  2011

5.  A comparative study of cell classifiers for image-based high-throughput screening.

Authors:  Syed Saiden Abbas; Tjeerd M H Dijkstra; Tom Heskes
Journal:  BMC Bioinformatics       Date:  2014-10-21       Impact factor: 3.169

6.  Two-Layered Graph-Cuts-Based Classification of LiDAR Data in Urban Areas.

Authors:  Yetao Yang; Ke Wu; Yi Wang; Tao Chen; Xiang Wang
Journal:  Sensors (Basel)       Date:  2019-10-28       Impact factor: 3.576

7.  Distributional learning of appearance.

Authors:  Lewis D Griffin; M Husni Wahab; Andrew J Newell
Journal:  PLoS One       Date:  2013-02-27       Impact factor: 3.240

8.  Toward a unified model of face and object recognition in the human visual system.

Authors:  Guy Wallis
Journal:  Front Psychol       Date:  2013-08-15

9.  Small infrared target detection by region-adaptive clutter rejection for sea-based infrared search and track.

Authors:  Sungho Kim; Joohyoung Lee
Journal:  Sensors (Basel)       Date:  2014-07-22       Impact factor: 3.576

10.  Robust Ground Target Detection by SAR and IR Sensor Fusion Using Adaboost-Based Feature Selection.

Authors:  Sungho Kim; Woo-Jin Song; So-Hyun Kim
Journal:  Sensors (Basel)       Date:  2016-07-19       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.