Literature DB >> 35015632

Fast and Accurate Least-Mean-Squares Solvers for High Dimensional Data.

Alaa Maalouf, Ibrahim Jubran, Danny Feldman.   

Abstract

Least-mean-squares (LMS) solvers such as Linear / Ridge-Regression and SVD not only solve fundamental machine learning problems, but are also the building blocks in a variety of other methods, such as matrix factorizations. We suggest an algorithm that gets a finite set of n d -dimensional real vectors and returns a subset of d+1 vectors with positive weights whose weighted sum is \emph{exactly} the same. The constructive proof in Caratheodory's Theorem computes such a subset in O(n2d2) time and thus not used in practice. Our algorithm computes this subset in O(nd+d4logn) time, using O(logn) calls to Caratheodory's construction on small but '`smart'' subsets. This is based on a novel paradigm of fusion between different data summarization techniques, known as sketches and coresets. For large values of d, we suggest a faster construction that takes O(nd) time and returns a weighted subset of O(d) sparsified input points. Here, a sparsified point means that some of its entries were set to zero. As an application, we show how to boost the performance of existing LMS solvers, such as those in scikit-learn library, up to x100. Generalization for streaming and distributed data is trivial. Extensive experimental results and open source code are provided.

Entities:  

Year:  2022        PMID: 35015632     DOI: 10.1109/TPAMI.2021.3139612

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  1 in total

1.  Predicting response to immunotherapy in gastric cancer via multi-dimensional analyses of the tumour immune microenvironment.

Authors:  Yang Chen; Keren Jia; Yu Sun; Cheng Zhang; Yilin Li; Li Zhang; Zifan Chen; Jiangdong Zhang; Yajie Hu; Jiajia Yuan; Xingwang Zhao; Yanyan Li; Jifang Gong; Bin Dong; Xiaotian Zhang; Jian Li; Lin Shen
Journal:  Nat Commun       Date:  2022-08-18       Impact factor: 17.694

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.