Literature DB >> 26597002

A generalized procedure for analyzing sustained and dynamic vocal fold vibrations from laryngeal high-speed videos using phonovibrograms.

Jakob Unger1, Maria Schuster2, Dietmar J Hecker3, Bernhard Schick3, Jörg Lohscheller4.   

Abstract

OBJECTIVE: This work presents a computer-based approach to analyze the two-dimensional vocal fold dynamics of endoscopic high-speed videos, and constitutes an extension and generalization of a previously proposed wavelet-based procedure. While most approaches aim for analyzing sustained phonation conditions, the proposed method allows for a clinically adequate analysis of both dynamic as well as sustained phonation paradigms.
MATERIALS AND METHODS: The analysis procedure is based on a spatio-temporal visualization technique, the phonovibrogram, that facilitates the documentation of the visible laryngeal dynamics. From the phonovibrogram, a low-dimensional set of features is computed using a principle component analysis strategy that quantifies the type of vibration patterns, irregularity, lateral symmetry and synchronicity, as a function of time. Two different test bench data sets are used to validate the approach: (I) 150 healthy and pathologic subjects examined during sustained phonation. (II) 20 healthy and pathologic subjects that were examined twice: during sustained phonation and a glissando from a low to a higher fundamental frequency. In order to assess the discriminative power of the extracted features, a Support Vector Machine is trained to distinguish between physiologic and pathologic vibrations. The results for sustained phonation sequences are compared to the previous approach. Finally, the classification performance of the stationary analyzing procedure is compared to the transient analysis of the glissando maneuver.
RESULTS: For the first test bench the proposed procedure outperformed the previous approach (proposed feature set: accuracy: 91.3%, sensitivity: 80%, specificity: 97%, previous approach: accuracy: 89.3%, sensitivity: 76%, specificity: 96%). Comparing the classification performance of the second test bench further corroborates that analyzing transient paradigms provides clear additional diagnostic value (glissando maneuver: accuracy: 90%, sensitivity: 100%, specificity: 80%, sustained phonation: accuracy: 75%, sensitivity: 80%, specificity: 70%).
CONCLUSIONS: The incorporation of parameters describing the temporal evolvement of vocal fold vibration clearly improves the automatic identification of pathologic vibration patterns. Furthermore, incorporating a dynamic phonation paradigm provides additional valuable information about the underlying laryngeal dynamics that cannot be derived from sustained conditions. The proposed generalized approach provides a better overall classification performance than the previous approach, and hence constitutes a new advantageous tool for an improved clinical diagnosis of voice disorders.
Copyright © 2015 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Dynamic phonation; High-speed laryngoscopy; Multiscale product; Voice disorder; Wavelet ridge; Wavelet-based analysis

Mesh:

Year:  2015        PMID: 26597002     DOI: 10.1016/j.artmed.2015.10.002

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  6 in total

1.  Fully automatic segmentation of glottis and vocal folds in endoscopic laryngeal high-speed videos using a deep Convolutional LSTM Network.

Authors:  Mona Kirstin Fehling; Fabian Grosch; Maria Elke Schuster; Bernhard Schick; Jörg Lohscheller
Journal:  PLoS One       Date:  2020-02-10       Impact factor: 3.240

2.  Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy.

Authors:  Michael Döllinger; Pablo Gómez; Rita R Patel; Christoph Alexiou; Christopher Bohr; Anne Schützenberger
Journal:  PLoS One       Date:  2017-11-09       Impact factor: 3.240

3.  Machine learning based identification of relevant parameters for functional voice disorders derived from endoscopic high-speed recordings.

Authors:  Patrick Schlegel; Stefan Kniesburges; Stephan Dürr; Anne Schützenberger; Michael Döllinger
Journal:  Sci Rep       Date:  2020-06-29       Impact factor: 4.379

4.  Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters.

Authors:  Patrick Schlegel; Melda Kunduk; Michael Stingl; Marion Semmler; Michael Döllinger; Christopher Bohr; Anne Schützenberger
Journal:  PLoS One       Date:  2019-04-22       Impact factor: 3.240

5.  Interdependencies between acoustic and high-speed videoendoscopy parameters.

Authors:  Patrick Schlegel; Andreas M Kist; Melda Kunduk; Stephan Dürr; Michael Döllinger; Anne Schützenberger
Journal:  PLoS One       Date:  2021-02-02       Impact factor: 3.240

6.  Fluid-structure-acoustic interactions in an ex vivo porcine phonation model.

Authors:  Marion Semmler; David A Berry; Anne Schützenberger; Michael Döllinger
Journal:  J Acoust Soc Am       Date:  2021-03       Impact factor: 1.840

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.