Literature DB >> 1960278

Gender recognition from speech. Part I: Coarse analysis.

K Wu1, D G Childers.   

Abstract

The purpose of this research was to investigate the potential effectiveness of digital speech processing and pattern recognition techniques for the automatic recognition of gender from speech segments. In this paper "coarse" acoustic coefficients (autocorrelation, linear prediction, cepstrum, and reflection) were used to form test and reference templates for vowels, voiced fricatives, and unvoiced fricatives. The effects of different distance measures, filter orders, recognition schemes, and vowels and fricatives were comparatively assessed to determine their effectiveness for the task of gender recognition from speech segments. The results showed that most of the acoustic parameters worked well for gender recognition. A within-gender and within-subject averaging technique was important for generating appropriate test and reference templates. The Euclidean distance measure appeared to be the most robust as well as the simplest of the distance measures. The results from this study implied that the gender information is time invariant, phoneme independent, and speaker independent for a given gender. One recognition scheme achieved 100% correct speaker gender classification for a database of 52 talkers (27 male and 25 female). In part II of this paper [D.G. Childers and K. Wu, J. Acoust. Soc. Am. 90, 1841-1856 (1991); hereafter referred to as paper II] the detailed features of ten vowels that appeared responsible for distinguishing a speaker's gender were examined statistically. Included in paper II is a replication of part of the classical study of Peterson and Barney [J. Acoust. Soc. Am. 24, 175-184 (1952)] of vowel characteristics.

Entities:  

Mesh:

Year:  1991        PMID: 1960278     DOI: 10.1121/1.401663

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  10 in total

Review 1.  Voice processing in human and non-human primates.

Authors:  Pascal Belin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-12-29       Impact factor: 6.237

2.  Developmental sexual dimorphism of the oral and pharyngeal portions of the vocal tract: an imaging study.

Authors:  Houri K Vorperian; Shubing Wang; E Michael Schimek; Reid B Durtschi; Ray D Kent; Lindell R Gentry; Moo K Chung
Journal:  J Speech Lang Hear Res       Date:  2010-11-24       Impact factor: 2.297

3.  Discrimination of speaker sex and size when glottal-pulse rate and vocal-tract length are controlled.

Authors:  David R R Smith; Thomas C Walters; Roy D Patterson
Journal:  J Acoust Soc Am       Date:  2007-12       Impact factor: 1.840

4.  The interaction of glottal-pulse rate and vocal-tract length in judgements of speaker size, sex, and age.

Authors:  David R R Smith; Roy D Patterson
Journal:  J Acoust Soc Am       Date:  2005-11       Impact factor: 1.840

5.  Cerebral processing of voice gender studied using a continuous carryover FMRI design.

Authors:  Ian Charest; Cyril Pernet; Marianne Latinus; Frances Crabbe; Pascal Belin
Journal:  Cereb Cortex       Date:  2012-04-05       Impact factor: 5.357

6.  The role of pitch and timbre in voice gender categorization.

Authors:  Cyril R Pernet; Pascal Belin
Journal:  Front Psychol       Date:  2012-02-03

7.  The Perception of Operational Sex Ratios by Voice.

Authors:  John G Neuhoff
Journal:  Sci Rep       Date:  2017-12-19       Impact factor: 4.379

8.  Measuring neuropsychiatric symptoms in patients with early cognitive decline using speech analysis.

Authors:  Alexandra König; Elisa Mallick; Johannes Tröger; Nicklas Linz; Radia Zeghari; Valeria Manera; Philippe Robert
Journal:  Eur Psychiatry       Date:  2021-10-13       Impact factor: 5.361

Review 9.  Hearing, touching, and multisensory integration during mate choice.

Authors:  Constanze Lenschow; Ana Rita P Mendes; Susana Q Lima
Journal:  Front Neural Circuits       Date:  2022-09-20       Impact factor: 3.342

10.  Cerebral activity to opposite-sex voices reflected by event-related potentials.

Authors:  Ya Li; Feng Gu; Xiliang Zhang; Lizhuang Yang; Lijun Chen; Zhengde Wei; Rujing Zha; Ying Wang; Xiaoming Li; Yifeng Zhou; Xiaochu Zhang
Journal:  PLoS One       Date:  2014-04-11       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.