Literature DB >> 18785855

Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis.

Cédric Févotte1, Nancy Bertin, Jean-Louis Durrieu.   

Abstract

This letter presents theoretical, algorithmic, and experimental results about nonnegative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. We describe how IS-NMF is underlaid by a well-defined statistical model of superimposed gaussian components and is equivalent to maximum likelihood estimation of variance parameters. This setting can accommodate regularization constraints on the factors through Bayesian priors. In particular, inverse-gamma and gamma Markov chain priors are considered in this work. Estimation can be carried out using a space-alternating generalized expectation-maximization (SAGE) algorithm; this leads to a novel type of NMF algorithm, whose convergence to a stationary point of the IS cost function is guaranteed. We also discuss the links between the IS divergence and other cost functions used in NMF, in particular, the Euclidean distance and the generalized Kullback-Leibler (KL) divergence. As such, we describe how IS-NMF can also be performed using a gradient multiplicative algorithm (a standard algorithm structure in NMF) whose convergence is observed in practice, though not proven. Finally, we report a furnished experimental comparative study of Euclidean-NMF, KL-NMF, and IS-NMF algorithms applied to the power spectrogram of a short piano sequence recorded in real conditions, with various initializations and model orders. Then we show how IS-NMF can successfully be employed for denoising and upmix (mono to stereo conversion) of an original piece of early jazz music. These experiments indicate that IS-NMF correctly captures the semantics of audio and is better suited to the representation of music signals than NMF with the usual Euclidean and KL costs.

Entities:  

Mesh:

Year:  2009        PMID: 18785855     DOI: 10.1162/neco.2008.04-08-771

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  17 in total

1.  Estimating nonnegative matrix model activations with deep neural networks to increase perceptual speech quality.

Authors:  Donald S Williamson; Yuxuan Wang; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2015-09       Impact factor: 1.840

2.  Improved Convolutive and Under-Determined Blind Audio Source Separation with MRF Smoothing.

Authors:  Rafał Zdunek
Journal:  Cognit Comput       Date:  2012-09-07       Impact factor: 5.418

Review 3.  Modelling and analysis of local field potentials for studying the function of cortical circuits.

Authors:  Gaute T Einevoll; Christoph Kayser; Nikos K Logothetis; Stefano Panzeri
Journal:  Nat Rev Neurosci       Date:  2013-11       Impact factor: 34.870

4.  Rectified Gaussian Scale Mixtures and the Sparse Non-Negative Least Squares Problem.

Authors:  Alican Nalci; Igor Fedorov; Maher Al-Shoukairi; Thomas T Liu; Bhaskar D Rao
Journal:  IEEE Trans Signal Process       Date:  2018-04-06       Impact factor: 4.931

5.  Directed Spectral Measures Improve Latent Network Models Of Neural Populations.

Authors:  Neil M Gallagher; Kafui Dzirasa; David Carlson
Journal:  Adv Neural Inf Process Syst       Date:  2021-12

6.  Acoustic Denoising using Dictionary Learning with Spectral and Temporal Regularization.

Authors:  Colin Vaz; Vikram Ramanarayanan; Shrikanth Narayanan
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2018-01-31

7.  A unified framework for sparse non-negative least squares using multiplicative updates and the non-negative matrix factorization problem.

Authors:  Igor Fedorov; Alican Nalci; Ritwik Giri; Bhaskar D Rao; Truong Q Nguyen; Harinath Garudadri
Journal:  Signal Processing       Date:  2018-01-06       Impact factor: 4.662

8. 

Authors:  Robert Peharz; Franz Pernkopf
Journal:  Neurocomputing       Date:  2012-03-15       Impact factor: 5.719

9.  Development of a real time sparse non-negative matrix factorization module for cochlear implants by using xPC target.

Authors:  Hongmei Hu; Agamemnon Krasoulis; Mark Lutman; Stefan Bleeck
Journal:  Sensors (Basel)       Date:  2013-10-14       Impact factor: 3.576

10.  Tracking Time Evolution of Collective Attention Clusters in Twitter: Time Evolving Nonnegative Matrix Factorisation.

Authors:  Shota Saito; Yoshito Hirata; Kazutoshi Sasahara; Hideyuki Suzuki
Journal:  PLoS One       Date:  2015-09-29       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.