Literature DB >> 12825646

Acoustic analysis of pathological voices compressed with MPEG system.

Julio Gonzalez1, Teresa Cervera, M José Llau.   

Abstract

The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech samples compressed at high bitrates are perceptually indistinguishable from the original samples, but very little was known about how compression acoustically affects the voice signal. A previous work with normal voices showed a high fidelity at high-bitrate compressions both in voice parameters and the amplitude-frequency spectrum. In the present work, dysphonic voices were tested through two studies. In the first study, spectrograms, long-term average spectra (LTAS), and fast Fourier transform (FFT) spectra of compressed and original samples of running speech were compared. In the second study, intensities, formant frequencies, formant bandwidths, and a multidimensional set of voice parameters were tested in a set of sustained phonations. Results showed that compression at high bitrates (96 and 128 kbps) preserved the relevant acoustic properties of the pathological voices. With compressions at lower bitrates, fidelity decreases, introducing some important alterations. Results from both works, Gonzalez and Cervera and this paper, open up the possibility of using MPEG-compression at high bitrates to store or transmit high-quality speech recordings, without altering their acoustic properties.

Entities:  

Mesh:

Year:  2003        PMID: 12825646     DOI: 10.1016/s0892-1997(03)00007-9

Source DB:  PubMed          Journal:  J Voice        ISSN: 0892-1997            Impact factor:   2.009


  3 in total

1.  Extreme compression for extreme conditions: pilot study to identify optimal compression of CT images using MPEG-4 video compression.

Authors:  P Gabriel Peterson; Sung K Pak; Binh Nguyen; Genevieve Jacobs; Les Folio
Journal:  J Digit Imaging       Date:  2012-12       Impact factor: 4.056

2.  Adaptations in humans for assessing physical strength from the voice.

Authors:  Aaron Sell; Gregory A Bryant; Leda Cosmides; John Tooby; Daniel Sznycer; Christopher von Rueden; Andre Krauss; Michael Gurven
Journal:  Proc Biol Sci       Date:  2010-06-16       Impact factor: 5.349

3.  Adaptive Multi-Rate Compression Effects on Vowel Analysis.

Authors:  David Ireland; Christina Knuepffer; Simon J McBride
Journal:  Front Bioeng Biotechnol       Date:  2015-08-20
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.