Literature DB >> 30271810

Acoustic Denoising using Dictionary Learning with Spectral and Temporal Regularization.

Colin Vaz1, Vikram Ramanarayanan2, Shrikanth Narayanan1.   

Abstract

We present a method for speech enhancement of data collected in extremely noisy environments, such as those obtained during magnetic resonance imaging (MRI) scans. We propose an algorithm based on dictionary learning to perform this enhancement. We use complex nonnegative matrix factorization with intra-source additivity (CMF-WISA) to learn dictionaries of the noise and speech+noise portions of the data and use these to factor the noisy spectrum into estimated speech and noise components. We augment the CMF-WISA cost function with spectral and temporal regularization terms to improve the noise modeling. Based on both objective and subjective assessments, we find that our algorithm significantly outperforms traditional techniques such as Least Mean Squares (LMS) filtering, while not requiring prior knowledge or specific assumptions such as periodicity of the noise waveforms that current state-of-the-art algorithms require.

Entities:  

Keywords:  complex NMF; dictionary learning; noise suppression; real-time MRI

Year:  2018        PMID: 30271810      PMCID: PMC6157637          DOI: 10.1109/TASLP.2018.2800280

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  11 in total

1.  Electromagnetic articulography treatment for an adult with Broca's aphasia and apraxia of speech.

Authors:  W F Katz; S V Bharadwaj; B Carstens
Journal:  J Speech Lang Hear Res       Date:  1999-12       Impact factor: 2.297

Review 2.  Auditory noise associated with MR procedures: a review.

Authors:  M McJury; F G Shellock
Journal:  J Magn Reson Imaging       Date:  2000-07       Impact factor: 4.813

3.  An approach to real-time magnetic resonance imaging for speech production.

Authors:  Shrikanth Narayanan; Krishna Nayak; Sungbok Lee; Abhinav Sethy; Dani Byrd
Journal:  J Acoust Soc Am       Date:  2004-04       Impact factor: 1.840

4.  Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans.

Authors:  Erik Bresch; Jon Nielsen; Krishna Nayak; Shrikanth Narayanan
Journal:  J Acoust Soc Am       Date:  2006-10       Impact factor: 1.840

5.  Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis.

Authors:  Cédric Févotte; Nancy Bertin; Jean-Louis Durrieu
Journal:  Neural Comput       Date:  2009-03       Impact factor: 2.026

6.  Objective and subjective evaluation of adaptive speech enhancement methods for functional MRI.

Authors:  Venkat R Ramachandran; Issa M S Panahi; Ali A Milani
Journal:  J Magn Reson Imaging       Date:  2010-01       Impact factor: 4.813

7.  Towards undistorted and noise-free speech in an MRI scanner: correlation subtraction followed by spectral noise gating.

Authors:  Joshua M Inouye; Silvia S Blemker; David I Inouye
Journal:  J Acoust Soc Am       Date:  2014-03       Impact factor: 1.840

8.  Abnormal articulatory dynamics in a patient with apraxia of speech: x-ray microbeam observation.

Authors:  M Itoh; S Sasanuma; H Hirose; H Yoshioka; T Ushijima
Journal:  Brain Lang       Date:  1980-09       Impact factor: 2.381

9.  Flexible retrospective selection of temporal resolution in real-time speech MRI using a golden-ratio spiral view order.

Authors:  Yoon-Chul Kim; Shrikanth S Narayanan; Krishna S Nayak
Journal:  Magn Reson Med       Date:  2010-12-16       Impact factor: 4.668

10.  Timing effects of syllable structure and stress on nasals: a real-time MRI examination.

Authors:  Dani Byrd; Stephen Tobin; Erik Bresch; Shrikanth Narayanan
Journal:  J Phon       Date:  2009-01-01
View more
  4 in total

1.  3D dynamic MRI of the vocal tract during natural speech.

Authors:  Yongwan Lim; Yinghua Zhu; Sajan Goud Lingala; Dani Byrd; Shrikanth Narayanan; Krishna Shrinivas Nayak
Journal:  Magn Reson Med       Date:  2018-11-03       Impact factor: 4.668

2.  A modular architecture for articulatory synthesis from gestural specification.

Authors:  Rachel Alexander; Tanner Sorensen; Asterios Toutios; Shrikanth Narayanan
Journal:  J Acoust Soc Am       Date:  2019-12       Impact factor: 1.840

3.  4D magnetic resonance imaging atlas construction using temporally aligned audio waveforms in speech.

Authors:  Fangxu Xing; Riwei Jin; Imani R Gilbert; Jamie L Perry; Bradley P Sutton; Xiaofeng Liu; Georges El Fakhri; Ryan K Shosted; Jonghye Woo
Journal:  J Acoust Soc Am       Date:  2021-11       Impact factor: 1.840

4.  Variation in compensatory strategies as a function of target constriction degree in post-glossectomy speech.

Authors:  Christina Hagedorn; Yijing Lu; Asterios Toutios; Uttam Sinha; Louis Goldstein; Shrikanth Narayanan
Journal:  JASA Express Lett       Date:  2022-04-22
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.