Literature DB >> 17627043

Segmentation of multivariate mixed data via Lossy data coding and compression.

Yi Ma1, Harm Derksen, Wei Hong.   

Abstract

In this paper, based on ideas from lossy data coding and compression, we present a simple but effective technique for segmenting multivariate mixed data that are drawn from a mixture of Gaussian distributions, which are allowed to be almost degenerate. The goal is to find the optimal segmentation that minimizes the overall coding length of the segmented data, subject to a given distortion. By analyzing the coding length/rate of mixed data, we formally establish some strong connections of data segmentation to many fundamental concepts in lossy data compression and rate distortion theory. We show that a deterministic segmentation is approximately the (asymptotically) optimal solution for compressing mixed data. We propose a very simple and effective algorithm which depends on a single parameter, the allowable distortion. At any given distortion, the algorithm automatically determines the corresponding number and dimension of the groups and does not involve any parameter estimation. Simulation results reveal intriguing phase-transition-like behaviors of the number of segments when changing the level of distortion or the amount of outliers. Finally, we demonstrate how this technique can be readily applied to segment real imagery and bioinformatic data.

Mesh:

Year:  2007        PMID: 17627043     DOI: 10.1109/TPAMI.2007.1085

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  5 in total

1.  Two-Point Correlation as a Feature for Histology Images: Feature Space Structure and Correlation Updating.

Authors:  Lee Cooper; Joel Saltz; Raghu Machiraju; Kun Huang
Journal:  Conf Comput Vis Pattern Recognit Workshops       Date:  2010

2.  CTRL: Closed-Loop Transcription to an LDR via Minimaxing Rate Reduction.

Authors:  Xili Dai; Shengbang Tong; Mingyang Li; Ziyang Wu; Michael Psenka; Kwan Ho Ryan Chan; Pengyuan Zhai; Yaodong Yu; Xiaojun Yuan; Heung-Yeung Shum; Yi Ma
Journal:  Entropy (Basel)       Date:  2022-03-25       Impact factor: 2.738

3.  Gaussian multiscale aggregation applied to segmentation in hand biometrics.

Authors:  Alberto de Santos Sierra; Carmen Sánchez Avila; Javier Guerra Casanova; Gonzalo Bailador del Pozo
Journal:  Sensors (Basel)       Date:  2011-11-28       Impact factor: 3.576

4.  Identifying subspace gene clusters from microarray data using low-rank representation.

Authors:  Yan Cui; Chun-Hou Zheng; Jian Yang
Journal:  PLoS One       Date:  2013-03-19       Impact factor: 3.240

5.  Track-Before-Detect Framework-Based Vehicle Monocular Vision Sensors.

Authors:  Hernan Gonzalez; Sergio Rodriguez; Abdelhafid Elouardi
Journal:  Sensors (Basel)       Date:  2019-01-29       Impact factor: 3.576

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.