Literature DB >> 21552465

Spectral Regularization Algorithms for Learning Large Incomplete Matrices.

Rahul Mazumder1, Trevor Hastie, Robert Tibshirani.   

Abstract

We use convex relaxation techniques to provide a sequence of regularized low-rank solutions for large-scale matrix completion problems. Using the nuclear norm as a regularizer, we provide a simple and very efficient convex algorithm for minimizing the reconstruction error subject to a bound on the nuclear norm. Our algorithm Soft-Impute iteratively replaces the missing elements with those obtained from a soft-thresholded SVD. With warm starts this allows us to efficiently compute an entire regularization path of solutions on a grid of values of the regularization parameter. The computationally intensive part of our algorithm is in computing a low-rank SVD of a dense matrix. Exploiting the problem structure, we show that the task can be performed with a complexity linear in the matrix dimensions. Our semidefinite-programming algorithm is readily scalable to large matrices: for example it can obtain a rank-80 approximation of a 10(6) × 10(6) incomplete matrix with 10(5) observed entries in 2.5 hours, and can fit a rank 40 approximation to the full Netflix training set in 6.6 hours. Our methods show very good performance both in training and test error when compared to other competitive state-of-the art techniques.

Entities:  

Year:  2010        PMID: 21552465      PMCID: PMC3087301     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  1 in total

1.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

  1 in total
  77 in total

1.  A New Multi-Atlas Registration Framework for Multimodal Pathological Images Using Conventional Monomodal Normal Atlases.

Authors:  Zhenyu Tang; Pew-Thian Yap; Dinggang Shen
Journal:  IEEE Trans Image Process       Date:  2018-12-17       Impact factor: 10.856

2.  Structured Matrix Completion with Applications to Genomic Data Integration.

Authors:  Tianxi Cai; T Tony Cai; Anru Zhang
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

3.  Accounting for non-genetic factors by low-rank representation and sparse regression for eQTL mapping.

Authors:  Can Yang; Lin Wang; Shuqin Zhang; Hongyu Zhao
Journal:  Bioinformatics       Date:  2013-02-17       Impact factor: 6.937

4.  Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.

Authors:  Kenneth Lange; Jeanette C Papp; Janet S Sinsheimer; Eric M Sobel
Journal:  Annu Rev Stat Appl       Date:  2014-01-01       Impact factor: 5.810

5.  Integrative factorization of bidimensionally linked matrices.

Authors:  Jun Young Park; Eric F Lock
Journal:  Biometrics       Date:  2019-11-10       Impact factor: 2.571

6.  Similarity-based Regularized Latent Feature Model for Link Prediction in Bipartite Networks.

Authors:  Wenjun Wang; Xue Chen; Pengfei Jiao; Di Jin
Journal:  Sci Rep       Date:  2017-12-05       Impact factor: 4.379

7.  MISSING DATA IMPUTATION IN THE ELECTRONIC HEALTH RECORD USING DEEPLY LEARNED AUTOENCODERS.

Authors:  Brett K Beaulieu-Jones; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2017

8.  Interpretation of machine learning predictions for patient outcomes in electronic health records.

Authors:  William La Cava; Christopher Bauer; Jason H Moore; Sarah A Pendergrass
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

9.  Imputation Strategy for Reliable Regional MRI Morphological Measurements.

Authors:  Shaina Sta Cruz; Ivo D Dinov; Megan M Herting; Clio González-Zacarías; Hosung Kim; Arthur W Toga; Farshid Sepehrband
Journal:  Neuroinformatics       Date:  2020-01

10.  Multisample aCGH data analysis via total variation and spectral regularization.

Authors:  Xiaowei Zhou; Can Yang; Xiang Wan; Hongyu Zhao; Weichuan Yu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Jan-Feb       Impact factor: 3.710

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.