Literature DB >> 30602420

Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs.

Yu A Malkov, D A Yashunin.   

Abstract

We present a new approach for the approximate K-nearest neighbor search based on navigable small world graphs with controllable hierarchy (Hierarchical NSW, HNSW). The proposed solution is fully graph-based, without any need for additional search structures (typically used at the coarse search stage of the most proximity graph techniques). Hierarchical NSW incrementally builds a multi-layer structure consisting of a hierarchical set of proximity graphs (layers) for nested subsets of the stored elements. The maximum layer in which an element is present is selected randomly with an exponentially decaying probability distribution. This allows producing graphs similar to the previously studied Navigable Small World (NSW) structures while additionally having the links separated by their characteristic distance scales. Starting the search from the upper layer together with utilizing the scale separation boosts the performance compared to NSW and allows a logarithmic complexity scaling. Additional employment of a heuristic for selecting proximity graph neighbors significantly increases performance at high recall and in case of highly clustered data. Performance evaluation has demonstrated that the proposed general metric space search index is able to strongly outperform previous opensource state-of-the-art vector-only approaches. Similarity of the algorithm to the skip list structure allows straightforward balanced distributed implementation.

Year:  2018        PMID: 30602420     DOI: 10.1109/TPAMI.2018.2889473

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  10 in total

1.  GuideMaker: Software to design CRISPR-Cas guide RNA pools in non-model genomes.

Authors:  Ravin Poudel; Lidimarie Trujillo Rodriguez; Christopher R Reisch; Adam R Rivers
Journal:  Gigascience       Date:  2022-04-01       Impact factor: 6.524

2.  Generalizing RNA velocity to transient cell states through dynamical modeling.

Authors:  Volker Bergen; Marius Lange; Stefan Peidli; F Alexander Wolf; Fabian J Theis
Journal:  Nat Biotechnol       Date:  2020-08-03       Impact factor: 54.908

3.  Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq.

Authors:  Bo Li; Joshua Gould; Yiming Yang; Siranush Sarkizova; Marcin Tabaka; Orr Ashenberg; Yanay Rosen; Michal Slyper; Monika S Kowalczyk; Alexandra-Chloé Villani; Timothy Tickle; Nir Hacohen; Orit Rozenblatt-Rosen; Aviv Regev
Journal:  Nat Methods       Date:  2020-07-27       Impact factor: 28.547

4.  Similarity-driven multi-view embeddings from high-dimensional biomedical data.

Authors:  Brian B Avants; Nicholas J Tustison; James R Stone
Journal:  Nat Comput Sci       Date:  2021-02-22

5.  CoCoA-diff: counterfactual inference for single-cell gene expression analysis.

Authors:  Yongjin P Park; Manolis Kellis
Journal:  Genome Biol       Date:  2021-08-17       Impact factor: 13.583

6.  Scarf enables a highly memory-efficient analysis of large-scale single-cell genomics data.

Authors:  Parashar Dhapola; Johan Rodhe; Rasmus Olofzon; Thomas Bonald; Eva Erlandsson; Shamit Soneji; Göran Karlsson
Journal:  Nat Commun       Date:  2022-08-08       Impact factor: 17.694

7.  Machine Learning Characterization of Ictal and Interictal States in EEG Aimed at Automated Seizure Detection.

Authors:  Gaetano Zazzaro; Luigi Pavone
Journal:  Biomedicines       Date:  2022-06-23

8.  Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis.

Authors:  Yuanchao Zhang; Man S Kim; Erin R Reichenberger; Ben Stear; Deanne M Taylor
Journal:  PLoS Comput Biol       Date:  2020-04-27       Impact factor: 4.475

9.  A multiresolution framework to characterize single-cell state landscapes.

Authors:  Shahin Mohammadi; Jose Davila-Velderrain; Manolis Kellis
Journal:  Nat Commun       Date:  2020-10-26       Impact factor: 14.919

10.  An analytical framework for interpretable and generalizable single-cell data analysis.

Authors:  Jian Zhou; Olga G Troyanskaya
Journal:  Nat Methods       Date:  2021-11-01       Impact factor: 28.547

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.