Literature DB >> 28647431

Temporal Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech.

Maryam Naghibolhosseini1, Dimitar D Deliyski2, Stephanie R C Zacharias3, Alessandro de Alarcon4, Robert F Orlikoff5.   

Abstract

OBJECTIVE: This study proposes a gradient-based method for temporal segmentation of laryngeal high-speed videoendoscopy (HSV) data obtained during connected speech.
METHODS: A custom-developed HSV system coupled with a flexible fiberoptic nasolaryngoscope was used to record one vocally normal female participant during reading of the "Rainbow Passage." A gradient-based algorithm was developed to generate a motion window. When applied to the HSV data, the motion window acted as a filter tracking the location of the vibrating vocal folds. The glottal area waveform was estimated using a statistical-based image-processing approach. The vocal fold vibratory frequency was computed by an autocorrelation-based extraction of the fundamental frequency (f0) from the glottal area waveform. Temporal segmentation was then performed based on the f0 contour and automatic detection of the epiglottic obstructions. Additionally, visual temporal segmentation was performed by viewing the HSV images frame by frame to determine the time points of the vocalization onsets and offsets, and the epiglottic obstructions of the glottis.
RESULTS: The time points resulting from the automatic and visual temporal segmentation methods were cross-validated. The f0-contour patterns of rise and fall resulting from the automatic algorithm were found to be in agreement with the visual inspection of the vibratory frequency change in the HSV data.
CONCLUSIONS: This study demonstrated the feasibility of automatic temporal segmentation of HSV imaging of connected speech, which allows for mapping the video content into onsets, offsets, and epiglottic obstructions for each vocalization. Automated analysis of HSV imaging of connected speech has significant clinical potential for advancing instrumental voice assessment protocols.
Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Connected speech; High-speed videoendoscopy; Laryngeal imaging; Voice assessment

Mesh:

Year:  2017        PMID: 28647431      PMCID: PMC5740029          DOI: 10.1016/j.jvoice.2017.05.014

Source DB:  PubMed          Journal:  J Voice        ISSN: 0892-1997            Impact factor:   2.009


  19 in total

1.  Acoustic and perceptual parameters relating to connected speech are more reliable measures of hoarseness than parameters relating to sustained vowels.

Authors:  Benjamin Halberstam
Journal:  ORL J Otorhinolaryngol Relat Spec       Date:  2004       Impact factor: 1.538

2.  Clinical implementation of laryngeal high-speed videoendoscopy: challenges and evolution.

Authors:  Dimitar D Deliyski; Pencho P Petrushev; Heather Shaw Bonilha; Terri Treman Gerlach; Bonnie Martin-Harris; Robert E Hillman
Journal:  Folia Phoniatr Logop       Date:  2007-11-30       Impact factor: 0.849

3.  Comparison of Videostroboscopy and High-speed Videoendoscopy in Evaluation of Supraglottic Phonation.

Authors:  Stephanie R C Zacharias; Charles M Myer; Jareen Meinzen-Derr; Lisa Kelchner; Dimitar D Deliyski; Alessandro de Alarcón
Journal:  Ann Otol Rhinol Laryngol       Date:  2016-07-12       Impact factor: 1.547

4.  Stroboscopy--a pertinent laryngological examination.

Authors:  P Kitzing
Journal:  J Otolaryngol       Date:  1985-06

5.  Utility of Laryngeal High-speed Videoendoscopy in Clinical Voice Assessment.

Authors:  Stephanie R C Zacharias; Dimitar D Deliyski; Terri Treman Gerlach
Journal:  J Voice       Date:  2017-06-07       Impact factor: 2.009

6.  Task specificity in adductor spasmodic dysphonia versus muscle tension dysphonia.

Authors:  Nelson Roy; Manon Gouse; Shannon C Mauszycki; Ray M Merrill; Marshall E Smith
Journal:  Laryngoscope       Date:  2005-02       Impact factor: 3.325

7.  Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels.

Authors:  Youri Maryn; Paul Corthals; Paul Van Cauwenberge; Nelson Roy; Marc De Bodt
Journal:  J Voice       Date:  2009-11-02       Impact factor: 2.009

8.  Comparison of high-speed digital imaging with stroboscopy for laryngeal imaging of glottal disorders.

Authors:  Rita Patel; Seth Dailey; Diane Bless
Journal:  Ann Otol Rhinol Laryngol       Date:  2008-06       Impact factor: 1.547

9.  Experimental Investigation on Minimum Frame Rate Requirements of High-Speed Videoendoscopy for Clinical Voice Assessment.

Authors:  Dimitar D Deliyski; Maria Eg Powell; Stephanie Rc Zacharias; Terri Treman Gerlach; Alessandro de Alarcon
Journal:  Biomed Signal Process Control       Date:  2014-12-29       Impact factor: 3.880

Review 10.  Muscle misuse voice disorders: description and classification.

Authors:  M D Morrison; L A Rammage
Journal:  Acta Otolaryngol       Date:  1993-05       Impact factor: 1.494

View more
  9 in total

1.  Method for Vertical Calibration of Laser-Projection Transnasal Fiberoptic High-Speed Videoendoscopy.

Authors:  Hamzeh Ghasemzadeh; Dimitar D Deliyski; David S Ford; James B Kobler; Robert E Hillman; Daryush D Mehta
Journal:  J Voice       Date:  2019-05-29       Impact factor: 2.009

2.  Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech.

Authors:  Ahmed M Yousef; Dimitar D Deliyski; Stephanie R C Zacharias; Alessandro de Alarcon; Robert F Orlikoff; Maryam Naghibolhosseini
Journal:  J Voice       Date:  2020-11-27       Impact factor: 2.300

3.  Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach.

Authors:  Ahmed M Yousef; Dimitar D Deliyski; Stephanie R C Zacharias; Maryam Naghibolhosseini
Journal:  J Voice       Date:  2022-03-15       Impact factor: 2.300

4.  Quantitative laryngoscopy with computer-aided diagnostic system for laryngeal lesions.

Authors:  Chung Feng Jeffrey Kuo; Wen-Sen Lai; Jagadish Barman; Shao-Cheng Liu
Journal:  Sci Rep       Date:  2021-05-12       Impact factor: 4.379

5.  Influence of spatial camera resolution in high-speed videoendoscopy on laryngeal parameters.

Authors:  Patrick Schlegel; Melda Kunduk; Michael Stingl; Marion Semmler; Michael Döllinger; Christopher Bohr; Anne Schützenberger
Journal:  PLoS One       Date:  2019-04-22       Impact factor: 3.240

6.  A Hybrid Machine-Learning-Based Method for Analytic Representation of the Vocal Fold Edges during Connected Speech.

Authors:  Ahmed M Yousef; Dimitar D Deliyski; Stephanie R C Zacharias; Alessandro de Alarcon; Robert F Orlikoff; Maryam Naghibolhosseini
Journal:  Appl Sci (Basel)       Date:  2021-01-27       Impact factor: 2.679

7.  Impact of Subharmonic and Aperiodic Laryngeal Dynamics on the Phonatory Process Analyzed in Ex Vivo Rabbit Models.

Authors:  Fabian Thornton; Michael Döllinger; Stefan Kniesburges; David Berry; Christoph Alexiou; Anne Schützenberger
Journal:  Appl Sci (Basel)       Date:  2019-05-13       Impact factor: 2.679

8.  Comparative analysis of high-speed videolaryngoscopy images and sound data simultaneously acquired from rigid and flexible laryngoscope: a pilot study.

Authors:  Wioletta Pietruszewska; Marcin Just; Joanna Morawska; Jakub Malinowski; Joanna Hoffman; Anna Racino; Magda Barańska; Magdalena Kowalczyk; Ewa Niebudek-Bogusz
Journal:  Sci Rep       Date:  2021-10-14       Impact factor: 4.379

9.  Long-term performance assessment of fully automatic biomedical glottis segmentation at the point of care.

Authors:  René Groh; Stephan Dürr; Anne Schützenberger; Marion Semmler; Andreas M Kist
Journal:  PLoS One       Date:  2022-09-21       Impact factor: 3.752

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.