Literature DB >> 26353063

Scalable Nearest Neighbor Algorithms for High Dimensional Data.

Marius Muja, David G Lowe.   

Abstract

For many computer vision and machine learning problems, large training sets are key for good performance. However, the most computationally expensive part of many computer vision and machine learning algorithms consists of finding nearest neighbor matches to high dimensional vectors that represent the training data. We propose new algorithms for approximate nearest neighbor matching and evaluate and compare them with previous algorithms. For matching high dimensional features, we find two algorithms to be the most efficient: the randomized k-d forest and a new algorithm proposed in this paper, the priority search k-means tree. We also propose a new algorithm for matching binary features by searching multiple hierarchical clustering trees and show it outperforms methods typically used in the literature. We show that the optimal nearest neighbor algorithm and its parameters depend on the data set characteristics and describe an automated configuration procedure for finding the best algorithm to search a particular data set. In order to scale to very large data sets that would otherwise not fit in the memory of a single machine, we propose a distributed nearest neighbor matching framework that can be used with any of the algorithms described in the paper. All this research has been released as an open source library called fast library for approximate nearest neighbors (FLANN), which has been incorporated into OpenCV and is now one of the most popular libraries for nearest neighbor matching.

Entities:  

Year:  2014        PMID: 26353063     DOI: 10.1109/TPAMI.2014.2321376

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  32 in total

1.  Pruning strategies for efficient online globally consistent mosaicking in fetoscopy.

Authors:  Marcel Tella-Amo; Loïc Peter; Dzhoshkun I Shakir; Jan Deprest; Danail Stoyanov; Tom Vercauteren; Sebastien Ourselin
Journal:  J Med Imaging (Bellingham)       Date:  2019-08-07

2.  Instance Search Retrospective with Focus on TRECVID.

Authors:  George Awad; Wessel Kraaij; Paul Over; Shin'ichi Satoh
Journal:  Int J Multimed Inf Retr       Date:  2017-02-22

3.  NearTree, a data structure and a software toolkit for the nearest-neighbor problem.

Authors:  Lawrence C Andrews; Herbert J Bernstein
Journal:  J Appl Crystallogr       Date:  2016-04-12       Impact factor: 3.304

4.  Phantomless Auto-Calibration and Online Calibration Assessment for a Tracked Freehand 2-D Ultrasound Probe.

Authors:  Matthew Toews; William M Wells
Journal:  IEEE Trans Med Imaging       Date:  2017-09-11       Impact factor: 10.048

5.  Robust continuous clustering.

Authors:  Sohil Atul Shah; Vladlen Koltun
Journal:  Proc Natl Acad Sci U S A       Date:  2017-08-29       Impact factor: 11.205

6.  A Likelihood-Free Approach for Characterizing Heterogeneous Diseases in Large-Scale Studies.

Authors:  Jenna Schabdach; William M Wells; Michael Cho; Kayhan N Batmanghelich
Journal:  Inf Process Med Imaging       Date:  2017-05-23

7.  Active Sensing for Continuous State and Action Spaces via Task-Action Entropy Minimization.

Authors:  Tipakorn Greigarn; M Cenk Çavuşoğlu
Journal:  Rep U S       Date:  2016-12-01

8.  MOLIERE: Automatic Biomedical Hypothesis Generation System.

Authors:  Justin Sybrandt; Michael Shtutman; Ilya Safro
Journal:  KDD       Date:  2017-08

9.  Acceleration and Parallelization of ZENO/Walk-on-Spheres.

Authors:  Derek Juba; Walid Keyrouz; Michael Mascagni; Mary Brady
Journal:  Procedia Comput Sci       Date:  2016

10.  A Feature-Based Approach to Big Data Analysis of Medical Images.

Authors:  Matthew Toews; Christian Wachinger; Raul San Jose Estepar; William M Wells
Journal:  Inf Process Med Imaging       Date:  2015
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.