Literature DB >> 35756358

Multi-scale affinities with missing data: Estimation and applications.

Min Zhang1, Gal Mishne2, Eric C Chi3.   

Abstract

Many machine learning algorithms depend on weights that quantify row and column similarities of a data matrix. The choice of weights can dramatically impact the effectiveness of the algorithm. Nonetheless, the problem of choosing weights has arguably not been given enough study. When a data matrix is completely observed, Gaussian kernel affinities can be used to quantify the local similarity between pairs of rows and pairs of columns. Computing weights in the presence of missing data, however, becomes challenging. In this paper, we propose a new method to construct row and column affinities even when data are missing by building off a co-clustering technique. This method takes advantage of solving the optimization problem for multiple pairs of cost parameters and filling in the missing values with increasingly smooth estimates. It exploits the coupled similarity structure among both the rows and columns of a data matrix. We show these affinities can be used to perform tasks such as data imputation, clustering, and matrix completion on graphs.

Entities:  

Keywords:  kernels; missing data; penalized estimation

Year:  2021        PMID: 35756358      PMCID: PMC9216212          DOI: 10.1002/sam.11561

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.247


  11 in total

1.  Biclustering via sparse singular value decomposition.

Authors:  Mihee Lee; Haipeng Shen; Jianhua Z Huang; J S Marron
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

2.  Objective Automatic Assessment of Rehabilitative Speech Treatment in Parkinson's Disease.

Authors:  Athanasios Tsanas; Max A Little; Cynthia Fox; Lorraine O Ramig
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2014-01       Impact factor: 3.802

3.  Multiway Graph Signal Processing on Tensors: Integrative analysis of irregular geometries.

Authors:  Jay S Stanley; Eric C Chi; Gal Mishne
Journal:  IEEE Signal Process Mag       Date:  2020-10-29       Impact factor: 12.551

4.  Data-Driven Tree Transforms and Metrics.

Authors:  Gal Mishne; Ronen Talmon; Israel Cohen; Ronald R Coifman; Yuval Kluger
Journal:  IEEE Trans Signal Inf Process Netw       Date:  2017-08-23

5.  Image processing using smooth ordering of its patches.

Authors:  Idan Ram; Michael Elad; Israel Cohen
Journal:  IEEE Trans Image Process       Date:  2013-04-12       Impact factor: 10.856

6.  Convex biclustering.

Authors:  Eric C Chi; Genevera I Allen; Richard G Baraniuk
Journal:  Biometrics       Date:  2016-05-10       Impact factor: 2.571

7.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices.

Authors:  Rahul Mazumder; Trevor Hastie; Robert Tibshirani
Journal:  J Mach Learn Res       Date:  2010-03-01       Impact factor: 3.654

8.  COBRAC: a fast implementation of convex biclustering with compression.

Authors:  Haidong Yi; Le Huang; Gal Mishne; Eric C Chi
Journal:  Bioinformatics       Date:  2021-04-27       Impact factor: 6.937

9.  Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome.

Authors:  Clara Higuera; Katheleen J Gardiner; Krzysztof J Cios
Journal:  PLoS One       Date:  2015-06-25       Impact factor: 3.240

10.  Optimal clustering with missing values.

Authors:  Shahin Boluki; Siamak Zamani Dadaneh; Xiaoning Qian; Edward R Dougherty
Journal:  BMC Bioinformatics       Date:  2019-06-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.