Literature DB >> 28660231

The Impact of Data Dependence on Speaker Recognition Evaluation.

Jin Chu Wu1, Alvin F Martin1, Craig S Greenberg1, Raghu N Kacker1.   

Abstract

The data dependency due to multiple use of the same subjects has impact on the standard error (SE) of the detection cost function (DCF) in speaker recognition evaluation. The DCF is defined as a weighted sum of the probabilities of type I and type II errors at a given threshold. A two-layer data structure is constructed: target scores are grouped into target sets based on the dependency, and likewise for non-target scores. On account of the needed equal probabilities for scores being selected when resampling, target sets must contain the same number of target scores, and so must non-target sets. In addition to the bootstrap method with i.i.d. assumption, the nonparametric two-sample one-layer and two-layer bootstrap methods are carried out based on whether the resampling takes place only on sets, or subsequently on scores within the sets. Due to the stochastic nature of the bootstrap, the distributions of the SEs of the DCF estimated using the three different bootstrap methods are created and compared. After performing hypothesis testing, it is found that data dependency increases not only the SE but also the variation of the SE, and the two-layer bootstrap is more conservative than the one-layer bootstrap. The rationale regarding the different impacts of the three bootstrap methods on the estimated SEs is investigated.

Entities:  

Keywords:  Data dependency; bootstrap; multinomial probability; resampling; speaker recognition; standard error

Year:  2016        PMID: 28660231      PMCID: PMC5484007          DOI: 10.1109/TASLP.2016.2614725

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  5 in total

1.  Comparison of three methods for estimating the standard error of the area under the curve in ROC analysis of quantitative data.

Authors:  Karim O Hajian-Tilaki; James A Hanley
Journal:  Acad Radiol       Date:  2002-11       Impact factor: 3.173

2.  Performance generalization in biometric authentication using joint user-specific and sample bootstraps.

Authors:  Norman Poh; Alvin Martin; Samy Bengio
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-03       Impact factor: 6.226

3.  Validation of Nonparametric Two-Sample Bootstrap in ROC Analysis on Large Datasets.

Authors:  Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal:  Commun Stat Simul Comput       Date:  2015-08-31       Impact factor: 1.118

4.  Comparison of quantitative diagnostic tests: type I error, power, and sample size.

Authors:  K Linnet
Journal:  Stat Med       Date:  1987-03       Impact factor: 2.373

5.  Measures, Uncertainties, and Significance Test in Operational ROC Analysis.

Authors:  Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal:  J Res Natl Inst Stand Technol       Date:  2011-02-01
  5 in total
  2 in total

1.  Monte Carlo studies of bootstrap variability in ROC analysis with data dependency.

Authors:  Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal:  Commun Stat Simul Comput       Date:  2018       Impact factor: 1.118

2.  A novel measure and significance testing in data analysis of cell image segmentation.

Authors:  Jin Chu Wu; Michael Halter; Raghu N Kacker; John T Elliott; Anne L Plant
Journal:  BMC Bioinformatics       Date:  2017-03-14       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.