Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The Impact of Data Dependence on Speaker Recognition Evaluation.

Literature DB >> 28660231

The Impact of Data Dependence on Speaker Recognition Evaluation.

Jin Chu Wu¹, Alvin F Martin¹, Craig S Greenberg¹, Raghu N Kacker¹.

Abstract

The data dependency due to multiple use of the same subjects has impact on the standard error (SE) of the detection cost function (DCF) in speaker recognition evaluation. The DCF is defined as a weighted sum of the probabilities of type I and type II errors at a given threshold. A two-layer data structure is constructed: target scores are grouped into target sets based on the dependency, and likewise for non-target scores. On account of the needed equal probabilities for scores being selected when resampling, target sets must contain the same number of target scores, and so must non-target sets. In addition to the bootstrap method with i.i.d. assumption, the nonparametric two-sample one-layer and two-layer bootstrap methods are carried out based on whether the resampling takes place only on sets, or subsequently on scores within the sets. Due to the stochastic nature of the bootstrap, the distributions of the SEs of the DCF estimated using the three different bootstrap methods are created and compared. After performing hypothesis testing, it is found that data dependency increases not only the SE but also the variation of the SE, and the two-layer bootstrap is more conservative than the one-layer bootstrap. The rationale regarding the different impacts of the three bootstrap methods on the estimated SEs is investigated.

Entities: Chemical Disease Gene Species

Keywords: Data dependency; bootstrap; multinomial probability; resampling; speaker recognition; standard error

Year: 2016 PMID： 28660231 PMCID： PMC5484007 DOI： 10.1109/TASLP.2016.2614725

Source DB: PubMed Journal: IEEE/ACM Trans Audio Speech Lang Process

5 in total

1. Comparison of three methods for estimating the standard error of the area under the curve in ROC analysis of quantitative data.

Authors: Karim O Hajian-Tilaki; James A Hanley
Journal: Acad Radiol Date: 2002-11 Impact factor: 3.173

2. Performance generalization in biometric authentication using joint user-specific and sample bootstraps.

Authors: Norman Poh; Alvin Martin; Samy Bengio
Journal: IEEE Trans Pattern Anal Mach Intell Date: 2007-03 Impact factor: 6.226

3. Validation of Nonparametric Two-Sample Bootstrap in ROC Analysis on Large Datasets.

Authors: Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal: Commun Stat Simul Comput Date: 2015-08-31 Impact factor: 1.118

4. Comparison of quantitative diagnostic tests: type I error, power, and sample size.

Authors: K Linnet
Journal: Stat Med Date: 1987-03 Impact factor: 2.373

5. Measures, Uncertainties, and Significance Test in Operational ROC Analysis.

Authors: Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal: J Res Natl Inst Stand Technol Date: 2011-02-01

5 in total

2 in total

1. Monte Carlo studies of bootstrap variability in ROC analysis with data dependency.

Authors: Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal: Commun Stat Simul Comput Date: 2018 Impact factor: 1.118

2. A novel measure and significance testing in data analysis of cell image segmentation.

Authors: Jin Chu Wu; Michael Halter; Raghu N Kacker; John T Elliott; Anne L Plant
Journal: BMC Bioinformatics Date: 2017-03-14 Impact factor: 3.169

2 in total