Literature DB >> 27475144

Exploring different attributes of source information for speaker verification with limited test data.

Rohan Kumar Das1, S R Mahadeva Prasanna1.   

Abstract

This work explores mel power difference of spectrum in subband, residual mel frequency cepstral coefficient, and discrete cosine transform of the integrated linear prediction residual for speaker verification under limited test data conditions. These three source features are found to capture different attributes of source information, namely, periodicity, smoothed spectrum information, and shape of the glottal signal, respectively. On the NIST SRE 2003 database, the proposed combination of the three source features performs better [equal error rate (EER): 20.19%, decision cost function (DCF): 0.3759] than the mel frequency cepstral coefficient feature (EER: 22.31%, DCF: 0.4128) for 2 s duration of test segments.

Entities:  

Year:  2016        PMID: 27475144     DOI: 10.1121/1.4954653

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  Towards understanding speaker discrimination abilities in humans and machines for text-independent short utterances of different speech styles.

Authors:  Soo Jin Park; Gary Yeung; Neda Vesselinova; Jody Kreiman; Patricia A Keating; Abeer Alwan
Journal:  J Acoust Soc Am       Date:  2018-07       Impact factor: 1.840

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.