Literature DB >> 22978853

A Bayesian inference model for speech localization (L).

José Escolano1, José M Perez-Lorenzo, Ning Xiang, Máximo Cobos, José J López.   

Abstract

The localization of active speakers with microphone arrays is an active research line with a considerable interest in many acoustic areas. Many algorithms for source localization are based on the computation of the Generalized Cross-Correlation function between microphone pairs employing phase transform weighting. Unfortunately, the performance of these methods is severely reduced when wall reflections and multiple sound sources are present in the acoustic environment. As a result, estimating the number of active sound sources and their actual directions becomes a challenging task. To effectively tackle this problem, a Bayesian inference framework is proposed. Based on a nested sampling algorithm, a mixture model and its parameters are estimated, indicating both the number of sources-model selection-and their angle of arrival-parameter estimation, respectively. A set of measured data demonstrates the accuracy of the proposed model.

Mesh:

Year:  2012        PMID: 22978853     DOI: 10.1121/1.4740489

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Audio-visual perception system for a humanoid robotic head.

Authors:  Raquel Viciana-Abad; Rebeca Marfil; Jose M Perez-Lorenzo; Juan P Bandera; Adrian Romero-Garces; Pedro Reche-Lopez
Journal:  Sensors (Basel)       Date:  2014-05-28       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.