Literature DB >> 34227721

Automatic boost articulation therapy in adults with dysarthria: Acceptability, usability and user interaction.

Viviana Mendoza Ramos1,2, Juan C Vasquez-Correa3, Rani Cremers4, Leen Van Den Steen1, Elmar Nöth3, Marc De Bodt1,2,4, Gwen Van Nuffelen1,2,4.   

Abstract

BACKGROUND: Imprecise articulation has a negative impact on speech intelligibility. Therefore, treatment of articulation is clinically relevant in patients with dysarthria. In order to be effective and according to the principles of motor learning, articulation therapy needs to be intensive, well organized, with adequate feedback and requires frequent practice. AIMS: The aims of this pilot study are (1) to evaluate the feasibility of a virtual articulation therapy (VAT) to guide patients with dysarthria through a boost articulation therapy (BArT) program; (2) to evaluate the acoustic models' performance used for automatic phonological error detection; and (3) to validate the system by end-users from their perspective. METHODS & PROCEDURES: The VAT provides an extensive and well-structured package of exercises with visual and auditory modelling and adequate feedback on the utterances. The tool incorporates automated methods to detect phonological errors, which are specifically designed to analyse Dutch speech production. A total of 14 subjects with dysarthria evaluated the acceptability, usability and user interaction with the VAT based on two completed therapy sessions using a self-designed questionnaire. OUTCOMES &
RESULTS: In general, participants were positive about the new computer-based therapy approach. The algorithm performance for phonological error detection shows it to be accurate, which contributes to adequate feedback of utterance production. The results of the study indicate that the VAT has a user-friendly interface that can be used independently by patients with dysarthria who have sufficient cognitive, linguistic, motoric and sensory skills to benefit from speech therapy. Recommendations were given by the end-users to further optimize the program and to ensure user engagement. CONCLUSIONS & IMPLICATIONS: The initial implementation of an automatic BArT shows it to be feasible and well accepted by end-users. The tool is an appropriate solution to increase the frequency and intensity of articulation training that supports traditional methods. WHAT THIS PAPER ADDS: What is already known on the subject Behavioural interventions to improve articulation in patients with dysarthria demand intensive treatments, repetitive practice and feedback. However, the current treatments are mainly limited in time to the interactive sessions in the presence of speech-language pathology. Automatic systems addressing the needs of individuals with dysarthria are scarce. This study evaluates the feasibility of a VAT program and investigates its acceptability, usability and user interaction. What this paper adds to existing knowledge The computer-based speech therapy approach developed and applied in this study intends to support intensive articulation training of patients with dysarthria. The virtual speech therapy offers the possibility of an individualized and customized therapy programme, with an extensive database of exercises, visual and auditory models of the target utterances, and providing adequate feedback based on automatic acoustic analysis of speech. What are the potential or actual clinical implications of this work? The automatic BArT overcomes the limitation in time of face-to-face traditional speech therapy. It offers patients the opportunity to have access to speech therapy more intensively and frequently in their home environment.
© 2021 The Authors. International Journal of Language & Communication Disorders published by John Wiley & Sons Ltd on behalf of Royal College of Speech and Language Therapists.

Entities:  

Mesh:

Year:  2021        PMID: 34227721      PMCID: PMC9546165          DOI: 10.1111/1460-6984.12647

Source DB:  PubMed          Journal:  Int J Lang Commun Disord        ISSN: 1368-2822            Impact factor:   2.909


INTRODUCTION

Dysarthria is a neurogenic motor speech disorder caused by impaired muscular control due to damage to the central or peripheral nervous system (Darley et al. 1969, 1975). The impairment may affect the mechanisms involved in speech production and consequently cause deficits in speech intelligibility. Imprecise speech sound articulation is a hallmark of dysarthria (Tjaden 2007, Weismer 2007). Research in this domain has shown that articulation is the most essential dimension of speech contributing to the overall speech intelligibility in dysarthria (Kent 1992, De Bodt et al. 2002). Deficient articulatory movements are linked with reduced acoustic contrast for consonants and vowels, which leads to reduced intelligibility (Tjaden 2007, Tykalova et al. 2017). Even when neurologic bases differ among different types of dysarthrias, the impact on articulatory movements and acoustic outcomes is in general comparable (Tjaden 2007). Reduced speech intelligibility has a major psychosocial impact. Daily life communication gets less evident, satisfactory or even impossible for subjects with dysarthria, affecting general quality of life (Dykstra et al. 2007, Dickson et al. 2008). Therefore, behavioural interventions to improve articulation and, consequently, speech intelligibility are necessary for individuals with dysarthria. There is scientific and clinical evidence that improving articulatory skills demands intensive and frequent therapy (Duffy 2019, Rosenbek and Jones 2009). There is growing evidence that the content of speech therapy must meet the principles of motor learning (Duffy 2019). Key elements of motor learning in clinical neurorehabilitation are intensive treatment, repetitive practice and sensory feedback (Kleim et al. 2003, Garvey et al. 2007, Kleim and Jones 2008, Maas et al. 2008). Extensive research in the speech therapy domain suggests that providing large amounts of practice over a shorter period can lead to better results/outcomes for adults with communication disorders (Bhogal et al. 2003, Fox et al. 2006). A more recent study demonstrated that boost articulation therapy (BArT), in which participants perform intensive articulatory drill and minimal pairs exercises for 5 consecutive days, has a significant positive effect on speech intelligibility in patients with dysarthria (Mendoza et al. 2021). However, the current treatments of articulation are mainly limited in time to the interactive sessions in the presence of speech–language pathology (SLP). A computer‐based speech therapy approach would overcome that limitation offering patients the opportunity to practise more intensively and frequently to reduce the number of fatiguing and time‐consuming transfers to the rehabilitation centre and to have access to speech therapy in their home environment. Virtual speech–language therapy would also provide patients with the possibility of an individualized and customized therapy programme. However, this virtual therapy program only targets those persons with dysarthria who have sufficient cognitive, linguistic, motoric and sensory (auditory and visual) skills to benefit from speech therapy (Weismer 2007). Virtual therapy technologies are not entirely new; however, only a few attempts have been made for adults with motor speech disorders (Lee 2019). Most of the reported studies using virtual therapy focus on adults with aphasia (e.g., Abad et al. 2013, Thompson et al. 2010, Van Vuuren and Cherney 2014, Cherney et al. 2008, Lee et al. 2009, Kalinyak‐Fliszar et al. 2015, Marshall et al. 2016, Amaya et al. 2018) and children with hearing impairment (Chaisanit et al. 2010, Eriksson et al. 2005, Massaro and Light 2004, Silva et al. 2012). Automatic systems addressing the needs of individuals with dysarthria are scarce (Lee 2019). The designed and tested experimental models include the single case study of Cole et al. (2007), which investigated the effects of Lee Silverman Voice Treatment (LSVT) with a virtual therapy system; qualitative evaluation of the system by the user shows positive results. The study of Palmer et al. (2007) compared the effects of computerized and traditional therapy for speakers with longstanding dysarthria. The computer‐based program consists of games that target the accurate production of individual speech sounds and exercises for practising sounds in words, phonation, pitch and volume, allowing independent training with feedback. This approach was demonstrated to be as effective as traditional therapy improving the speech of that specific group. The study of Saz et al. (2009), which evaluated a semi‐automated tool that includes different games for children and young adults, considers phonation, phonetic articulation and language understanding for Spanish speakers. The feedback given about the system by the therapist and end‐users was mainly positive. The study of Orozco‐Arroyave et al. (2020) introduced a mobile application for monitoring and evaluation of patients with Parkinson's disease. The app considers different motor aspects such as speech production, limb movements and finger tapping. It allows patients to track their progression and can be used as a personal health assistant. To our knowledge, there is no publicly available virtual articulation tool providing BArT with adequate feedback, especially not for Dutch‐speaking subjects with articulatory deficits. The present pilot study aims to evaluate the feasibility of a VAT for individuals with dysarthria. The goal of the system is to guide the patient through an intensive treatment programme based on the principles of motor learning and relying on speech technology for adequate feedback. It is intended to improve articulation and consequently pronunciation, intelligibility and communication abilities. The study also focuses on the evaluation of the system by end‐users. The purpose is to investigate the acceptability, usability and user interaction with computer‐assisted articulation therapy. The stakeholder's opinion is particularly valuable to ensure that the new virtual intervention provides a suitable and engaging treatment. Some important aspects to consider are the participants’ ability to interact independently with the virtual articulation trainer; whether they find the user interface attractive, supportive and clear, and if they find the material to be useful. It is also important to know how end‐users experience the visual feedback and their general perception of the program. Therefore, the main goals of this study are the design and development of a new computer‐based therapy program that supports intensive articulation training of patients with dysarthria, the evaluation of acoustic models’ performance used for automatic phonological error detection, and a general validation by stakeholders of the usability, acceptability and interaction with the automatic system.

MATERIALS AND METHODS

Participants

The developed program targets patients with dysarthria who have sufficient cognitive, motor and sensory (auditory and visual) skills to work with a computer and/or those who can rely on assistance. In order to participate in the study, the patients had to be adults (> 18 years of age) with a diagnosis of dysarthria due to neurogenic origin (confirmed by an experienced SLP and neurologist), native speakers of Dutch and with a score < 90% on the Dutch phoneme Intelligibility Assessment (DIA) (De Bodt et al. 2006). Participants were excluded from the study if they had serious visual or auditory problems, insufficient computer skills to use the software program independently and if an intensive articulation therapy was followed less than 2 months before the study in order to exclude possible effects of the training in the baseline of the intelligibility assessment. Patients were recruited by an experienced SLP at the Noorderhart Rehabilitation & Multiple Sclerosis Center Pelt in Hasselt by convenience sampling. A total of 14 subjects with dysarthria participated in this pilot study; their severity level was determined perceptually by the SLP. Table 1 displays the subjects’ characteristics. The Speech Handicap Index (SHI) (Van den Steen et al. 2011) was administrated to each participant; this instrument is a self‐evaluation of quality of life and quantifies the biopsychosocial impact of the speech disorder.
TABLE 1

Characteristics of the participants

ParticipantSexAge (years)Neurological pathologyType of dysarthriaSeverity dysarthriaDIA score (%)SHI score
1M59MSFlaccidModerate80%60
2M76StrokeFlaccidModerate82%18
3F76Parkinson's diseaseHypokineticMild90%32
4M58MSMixedMild68%20
5M70StrokeFlaccidModerate60%39
6M67StrokeFlaccidMild90%10
7M47Friedreich ataxiaAtaxicMild82%17
8M57MSFlaccidModerate62%41
9M29TBIFlaccidMild84%13
10F76Parkinson's diseaseHypokineticMild88%38
11F67StrokeFlaccidMild76%25
12F92StrokeAtaxicMild88%51
13F66MSSpasticMild90%23
14M67StrokeSpasticModerate76%1

Notes: M, male; F, female; MS, multiple sclerosis; TBI, traumatic brain injury; DIA, Dutch phoneme Intelligibility Assessment.

Score of the Speech Handicap Index (SHI) (score range: 0–60; score < 14 no impact, 14–22 light, 23–31 moderate, > 31 severe impact).

Characteristics of the participants Notes: M, male; F, female; MS, multiple sclerosis; TBI, traumatic brain injury; DIA, Dutch phoneme Intelligibility Assessment. Score of the Speech Handicap Index (SHI) (score range: 0–60; score < 14 no impact, 14–22 light, 23–31 moderate, > 31 severe impact).

Ethics permission

Ethical approval for this study was obtained by the Regional Committee of Medical and Health Research Ethics (B300201837099). All participants agreed voluntarily to participate in the study and signed an informed consent form.

Software

Design criteria

The VAT was designed for patients with articulation deficits in which a virtual therapist, acting like a real SLP, guides the patient through an intensive treatment program to improve articulation and consequently speech intelligibility. The developed software runs as a desktop application and can be installed on any computer. The VAT has multiple roles and functions: motivating and stimulating the patient, giving clear instructions, analysing the target utterance, and providing adequate feedback based on automatic acoustic analysis of speech. The development of the VAT system consisted of three main steps: (1) the development of an extensive database of exercises for articulation training; (2) the implementation of an algorithm for phonological error detection in order to provide the necessary feedback to the patients; and (3) the embedding of the set of new developments in an attractive, patient‐friendly user interface.

Extensive database of exercises and intervention design

The integrity of speech sounds is a major factor in determining intelligibility (Kent 1992), thus the direct treatment of articulation at the segmental level of speech is the aim of this therapy program. This approach intends to reduce the speech impairment, and its effects may be also generalized to connected speech. A strong correlation between word intelligibility and sentence intelligibility was also found in previous studies (Yorkston and Beukelman 1978, Weismer et al. 2001). The content of speech material consists of two different: intensive training programs to improve articulation and phoneme intelligibility in persons with dysarthria. Based on the principles of motor learning, the training programs comprise articulatory drill and minimal pairs. There are four levels of difficulty in each program, structured per target position of the consonant, in initial, medial and final position in the words. This allows the patient to go through intensive therapy. In a previous study, this particular approach (figure 1) resulted in significant improvements of intelligibility at different levels of speech in a group of adults with dysarthria (Mendoza et al. 2021). The extensive, well‐structured and hierarchical practice package of exercises recommended in that previous study is implemented in the VAT.
FIGURE 1

The intervention design (Mendoza et al. 2021). The components labelled (1) are exclusively intended for the intervention articulatory drill. The components labelled (2) are intended for the intervention of minimal pairs. When nothing is included, the parts are common for both interventions. *SLP scores each word by means of a scoring form; and **feedback.

The intervention design (Mendoza et al. 2021). The components labelled (1) are exclusively intended for the intervention articulatory drill. The components labelled (2) are intended for the intervention of minimal pairs. When nothing is included, the parts are common for both interventions. *SLP scores each word by means of a scoring form; and **feedback. The exercises only include a series of mono‐ and bisyllabic words. These types of words are usually not problematic to understand or to read for patients with mild cognitive impairments. Most of the selected words are meaningful and emphasize the correct pronunciation of the target phoneme, making the therapy more effective and goal oriented. The stimuli were carefully selected by experienced SLPs from a traditional articulation programme ‘Articulation in Practice’ (Huybrechts et al. 1999) and from the official word inventory of Dutch (Renkema 1995) to increase the number of stimuli.

Algorithm selection and modifications

The algorithm used in the program for phonological error detection derived from the acoustic signal is based on existing models (Vásquez‐Correa et al. 2019) using state‐of‐the‐art gated recurrent neural networks (GRNN). It has been shown to be accurate at evaluating the dysarthria severity of patients with motor speech disorders, such as patients affected with the neurodegenerative Parkinson's disease (Orozco‐Arroyave et al. 2020, Miller et al. 2020). The algorithm is a statistical approach to the human perception process and is based on the speech‐production mechanisms. It extracts the log Mel‐filterbank energy features from the acoustic signal and maps them into phonological features. Individual speech frames are represented by a vector with information about place and manner of articulation. This vector captures the posterior probability of a speech frame to belong to one or more phonological classes (e.g., plosive, nasal, bilabial). The model was originally designed for Spanish, and most of the extracted characteristics were related to vowels. Therefore, modifications were necessary in order to reliably characterize consonants, and extended to detect the Dutch‐specific phonological characteristics. The combination of place and manner of articulation and voiced characteristics were used to describe the consonants in Dutch. Hence, the algorithm was retrained using the language‐specific phonological features (Verhoeven 2005) (table 2) along with the read speech part of the Corpus Spoken Dutch (Oostdijk and Broeder 2003), which is an extensive corpus (903,043 words) of contemporary standard Dutch as it is spoken nowadays in the Netherlands and Dutch‐speaking part of Belgium. With the refinement and extension of the models, trained on healthy speech and capable to recognize all the phonological features, it can be used to detect more specific articulation errors/deficits in subjects with dysarthria. Accuracy, precision (positive predictive value), recall (sensitivity) and F‐score (harmonic mean of the precision and recall) performance measures to evaluate the models are offered in the results section.
TABLE 2

Consonants place and manner of articulation (Verhoeven 2005)

BilabialLabio‐dentalAlveolarPost‐alveolarPalatalVelarUvularGlottal
Plosivep bt d(c)k (g)(ʔ)
Nasalm(ɱ)n(ɳ)ŋ
Trill(r)ʀ
Fricativef vs z(ʃ) (ʒ)x ɣɦ
Approximantwj
Lateral approximantl(ʎ)

Note: Voiced consonants are on the right side.

Consonants place and manner of articulation (Verhoeven 2005) Note: Voiced consonants are on the right side. The newly trained algorithm extracts the phonological posterior probabilities from the acoustic signal and it is used to identify phonological errors present during the articulation training. However, the system does not make any decision with respect to the type and degree of error at the phoneme level (if the error was an addition, substitution, omission or distortion).

User interface

The training program (VAT) has three stages. In the first, the user selects the target sound that he or she will train (figure 2). Once the selection is made, a video instruction with the description and examples of the correct production of the sound (visual model) is given. This support is provided with frontal and lateral video recordings that show the correct way of sound production in an easy, friendly and interpretable way (figure 3). Finally, auditory modelling of the words is also offered when the set of exercises starts. The patient repeats the target words, and the system will automatically capture and analyse the signal. Subsequently, detailed visual feedback on the articulatory precision is provided. The graphs show whether or not the distinctive features of the target sound were present (figure 4). This detailed information, which is generally difficult to impossible to detect and quantify by the human ear, allows patients to work more precisely in the improvement of the articulatory movements. Misclassification or false detections of distinctive features could occur in the case of present background noise; therefore, it is recommended that patients practise in a quiet room and are encouraged to use a head‐mounted condenser microphone.
FIGURE 2

Sound selection. [Color figure can be viewed at wileyonlinelibrary.com]

FIGURE 3

Visual modelling. [Color figure can be viewed at wileyonlinelibrary.com]

FIGURE 4

Auditory modelling, exercises and feedback. [Color figure can be viewed at wileyonlinelibrary.com]

Sound selection. [Color figure can be viewed at wileyonlinelibrary.com] Visual modelling. [Color figure can be viewed at wileyonlinelibrary.com] Auditory modelling, exercises and feedback. [Color figure can be viewed at wileyonlinelibrary.com] For the general design of the user interface, the font, font size and button sizes are big, simple and clean, intended for elderly and patients with brain damage.

Instruments to measure user interaction, acceptability and usability

After two therapy sessions of 45 min each, all the participants evaluated the computer‐based therapy program. Usability, acceptability and interaction with the software were assessed through a questionnaire. A questionnaire designed specifically for this study was completed by end‐users after two sessions of practising with the VAT. The survey had the intention of evaluating software usability and acceptability from the user's point of view. The questionnaire was designed based on other questionnaires that addressed similar software‐related topics, such as the Software Usability Measurement Inventory (SUMI) (Kirakowski and Corbett 1993), the General Computer Skill Questionnaire (Klinkenberg et al. 2004) and the ASISTO questionnaire (Deruyter and Staessens 2018); the items were restructured focusing on the VAT. The set of questions addresses important topics that should be considered in order to succeed in future user engagement. It has a total of 32 items and is divided into five sections: Usability and operationality: this section is designed to evaluate the patient's perception of the software functionality as a tool, if they feel comfortable working with it, if they can interact independently with it and if they find it friendly and easy to use, in general. Program and layout: this section focuses on how the users experience the visual and auditory support received during them practising and how attractive and clear the layout is perceived. Exercises: this section is designed to evaluate the user's perception of the usefulness of the training therapy and if they feel that the selected set of exercises is helpful for improving articulatory skills and consequently speech intelligibility. Feedback: this section is designed to evaluate the perception of the visual feedback that is given during therapy. Experience in the use of multimedia: with this section we would like to have the tendency of multimedia usage in daily life. It will give some insights into how likely or often we can expect the use of the software. Each section contains several statements that participants have to score using a five‐point Likert scale, where a score of 1 corresponds to ‘completely disagree’ and a score of 5 corresponds to ‘completely agree’. The number of statements varies among sections; therefore, an average score is necessary for each section. Thereby, the questionnaire can have a maximum score of 25. The survey has also a section where the user might add suggestions or comments in order to improve the engagement and usability of the software. The survey is applied after two therapy sessions. For an overview of the questionnaire, see appendix A.

Training sessions

Before the start of the training, the goal of the virtual therapy and the steps of the program were explained in detail to every subject who participated in the study.

Training session 1

At the beginning of the first session, the DIA and SHI were conducted, followed by the supervised training moment. The selection of the target speech sounds to be trained was based on the error analysis of the DIA. The speech therapist started the computer and software and explained to the patient how to select the target speech sound and the level. Thereafter, it was demonstrated how to obtain auditory modelling of a training item, how to click to the next or preceding item, how to listen again to the patient's own speech production and how to go back to start. It was explained in detail how to understand and manage the given feedback. After the first session, all subjects received a printed version of the manual to consult at home if necessary.

Training session 2

The second session took place after a maximum of 2 days following the first session. During this session, patients worked completely autonomously with the VAT. It was expected to train with the same target sounds at the same level as during the first session. They were supposed to understand and manage the visual feedback in order to adjust their articulation. The questionnaire was administered at the end of this session. The supervisor was present for assistance if necessary.

Statistical analysis

Statistical analysis of the data was performed with the software program IBM Statistical Package for the Social Sciences v. 25 (IBM SPSS 25). The distribution of the data was checked for normality by means of the Shapiro–Wilk test, with a significance level of 0.05. Correlations between different variables were explored in order to investigate the relationship between the frequency of multimedia use, and between subsections of the questionnaire, and the overall evaluation of the program.

RESULTS

Algorithm performance

The trained Dutch models achieved high accuracy rates for the detection of all trained phonological classes. This statistical approach of the human perception process has detection rates > 88% (F‐score). During the learning process frequently occurring speech sounds of the language were modelled with higher accuracy than infrequently occurring sounds, but in general the results are considered accurate enough for the discrimination between each specific phonological class. Table 3 summarizes the performance of the models for the training dataset and the validation dataset (also called the development set). Precision, recall and F‐score performance measures are also offered.
TABLE 3

Performance of the models for Dutch

Phonological classAccuracy training setAccuracy validation setPrecisionRecall F‐score
Nasal0.9450.9350.9570.9350.941
Plosive0.9420.9100.9360.9100.917
Fricative0.9260.9360.9490.9360.940
Approximant0.9160.8890.9650.8890.917
Trill0.8990.9370.9690.9370.949
Labial0.9410.9170.9480.9170.927
Coronal0.8710.8840.8950.8840.887
Dorsal0.8850.9020.9340.9020.912
Glottal0.9470.9790.9910.9790.984
Voice0.9110.9350.9350.9350.935
Vowel0.8980.8920.9120.8920.896
Bilabial0.9560.9160.9580.9160.930
Labiodental0.9540.9750.9850.9750.978
Alveolar0.8740.8850.8980.8850.888
Postalveolar0.9750.9940.9980.9940.996
Pause0.9750.9780.9790.9780.978
Performance of the models for Dutch

Survey

Usability and operationality

Although some participants needed help with the mouse control, an average score of 4.14 out of 5 was given to the section ‘Usability and operationality’ (figure 5). This means that the patient's perception of the software functionality was generally found as easy to use and understand. Most participants agreed that the VAT is user‐friendly and can be used independently, but the answers to the statement ‘I would use the program daily at home’ were quite diverse. Half the subjects agreed or totally agreed, five were neutral and two disagreed. This could be related to the experience and daily use of multimedia, which was scored the lowest on the questionnaire, results which are presented below and that will be analysed later in this study. Figure 4 summarizes the average score of the items in this section.
FIGURE 5

Usability and operationality scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Usability and operationality scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Program and layout

The section ‘Program and layout’ received an average score of 4.46 out of 5 (figure 6). Most of the users found the program attractive, with a clear layout. The visual and auditory support for the target sound production was perceived as appealing and supportive. Finally, all subjects would recommend the use of the VAT to other patients.
FIGURE 6

Program and layout scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Program and layout scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Exercises

The section ‘Exercises’ received an average score of 4.23 out of 5 (figure 7). The users found the set of exercises to be clear and useful. This is particularly important because this type of repetitive and intensive training tends to be tiresome; therefore, it is essential that users find out the usefulness of the selected set of exercises that can help to improve their articulatory ability.
FIGURE 7

Exercises scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Exercises scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Feedback

The majority of users perceived the visual feedback given during therapy as clear and attractive. The section ‘Feedback’ received an average score of 3.96 out of 5 (figure 8). However, for this section the patients provided several suggestions to improve the stimulation and motivation of the user. They recommended also adding auditory feedback by the VAT in case of correct target production.
FIGURE 8

Feedback scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Feedback scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Experience in the use of multimedia

The section related to the participants’ use of multimedia received an average score of 3.06 out of 5 (figure 9), being the lowest of the questionnaire. These results clarify the score given before to the statement of daily usage of the software at home. Although the majority of the participants have some kind of multimedia device for entertainment, they do not use it every day. This is an interesting point that should be considered for the assignment of therapy frequency and duration.
FIGURE 9

Use of multimedia scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com]

Use of multimedia scored on a Likert scale (1 = ‘completely disagree’ and 5 = ‘completely agree’). [Color figure can be viewed at wileyonlinelibrary.com] A moderate correlation was found between experience in the use of multimedia and the perceived quality of the program r = 0.554 (p = 0.04), which was measured as the total score of the questionnaire. Patients with higher knowledge or experience with the usage of multimedia tended to have a more positive perception of the quality of the software‐based therapy. However, patients with less experience or interest in multimedia also showed a positive attitude to the therapy.

Sections with a higher impact on the patient's perception of the program

From a practical and statistical point of view, most of the aspects that were addressed in the questionnaire are considered highly significant for the end‐users’ perception of the quality, acceptability and usability of the computer‐based therapy program. The biggest impact on the overall score was related to the feedback, followed by the exercises; also usability and operationality were highly significant for end‐users. The program and layout seem to be important for practical use, but not statistically significant in the perception of acceptability and usability of the VAT (table 4).
TABLE 4

Pearson correlation coefficients and significance level between the total score and the subscales of the questionnaire

Use and operationProgram and layoutExercisesFeedback
Total score

0.672**

(p = 0.009)

0.408

(p = 0.147)

0.761**

(p = 0.002)

0.785**

(p = 0.001)

Note: Correlation is significant at the *0.05 level (two‐tailed); and **0.01 level (two‐tailed).

Pearson correlation coefficients and significance level between the total score and the subscales of the questionnaire 0.672** (p = 0.009) 0.408 (p = 0.147) 0.761** (p = 0.002) 0.785** (p = 0.001) Note: Correlation is significant at the *0.05 level (two‐tailed); and **0.01 level (two‐tailed).

DISCUSSION

This pilot study evaluated the feasibility of a VAT that guides patients with dysarthria through a BArT program, combining speech technology, phoneme‐specific feedback and motor learning principles. It is generally accepted that for motor speech disorders, greater therapy frequency leads to better ultimate performance (Dobkin and Thompson 2000). The VAT addresses the need for an accessible program granting patients the opportunity to train their articulatory skills more intensively and frequently. The tool provides a set of exercises, visual and auditory modelling of target utterances, and adequate feedback with accurate performance. It has an accessible interface for patients with brain damage who often deal with other motor and sensory dysfunctions and have limitations in language and cognition. Additionally, it can considerably increase the capacity of speech therapy for patients with practical limitations (distance, financial restrictions, etc.). The results of the pilot study show that the VAT addresses the elementary aspects necessary to deliver the BArT successfully and to ensure the user's engagement. The study‐specific questionnaire addressed the functionality, layout, content of the speech material used and feedback given during training. Quantitative evaluation after two sessions of interaction with the system shows that this study cohort, consisting of patients with mild to moderate dysarthria, but with sufficient cognitive, language, visual and auditory skills, were in general positive about this new computer‐based therapy approach. These results were consistent with previously published virtual therapies in the field of speech and language therapy (Cole et al. 2007, Saz et al. 2009, Engwall et al. 2006, Eriksson et al. 2005). The majority of participants ensured that the VAT is attractive, user‐friendly and can be used independently. The usefulness of the software was evaluated by two different statements within the survey, whether they found the type of exercises to be useful as well as the given automatic feedback. Patients’ responses confirmed that it is a useful tool that can help to improve their communication. The visual feedback was found to be attractive, clear and easy to interpret; however, there are some suggestions for its improvement and for keeping the user motivated, which is a key factor for successful training sessions. The recommendations include the addition of auditory feedback with a more stimulating and encouraging response, as a suggestion, applause that is played when the target sound is correctly pronounced, which can make the success in target production more evident to the users. Previous research indicates that rewarding feedback after accurate expressions is crucial to maintain a high motivation (Halpern et al. 2012, Öster 2006). Participants also pointed out that providing an overall score of articulation will grant a better view of their progress. The section related to the use of multimedia in daily life shows that this specific group of participants (mainly elderly patients or with brain damage) are not highly motivated to use multimedia devices; however, most of them indicated a positive attitude toward the VAT program. This could be interpreted as they feel the program can be helpful to improve their articulatory skills while practising in their environment. The general outcomes from the questionnaire can also be essential for traditional face‐to‐face therapy methods of articulation. SLPs may use these outcomes to provide better, more efficient and individualized articulatory therapy, taking into account that a well‐structured and hierarchical programme of exercises is necessary and needs to be goal oriented, and the most valuable information they can offer to the patients is good and encouraging feedback. The involvement of patients in the early stage of the design and developmental process of the system is a way to guarantee the usability and acceptability of the program. The recommendations/suggestions made by the participants will lead to a suitable, easy‐to‐use and engaging virtual therapy.

LIMITATIONS

The study was restricted in time to two VAT therapy sessions interacting with the software program in a rather small cohort. This implies that the results of this pilot study are preliminary and cannot be generalized. In future validation studies, a larger number of patients should be included. Adding qualitative interview data and perspectives of caregivers and patients’ family members may also lead to additional insights. Finally, the questionnaire can be upgraded by adding an item related to how accurate the subjects felt the feedback was.

FUTURE RESEARCH

Future research will focus on the evaluation of therapeutic effectiveness after introducing the VAT in daily patient care. This will include a higher number of subjects and objective measures of longitudinal articulatory improvements. Also, future research will focus on the optimization of the acoustic models for the automatic error detection in dysarthric speech, the development of an objective acoustic measure of sound distortion, and the correlation analysis of automatic versus perceptual measurements of the degree of distortion. Qualitative evaluation of the system by a wider community that includes caregivers, SLPs and patients’ family members will also be investigated. The integration of exercises with higher linguistic levels will also be the focus of future work. The VAT will be released as an open‐source system, with a general public licence. It will also be communicated via professional organizations of speech therapists and patient groups.

CONCLUSIONS

The computer‐based articulation therapy for adults with dysarthria seems to be feasible and well accepted by the end‐users of this pilot study. Most participants have a positive attitude towards the use of the VAT. Even participants with limited computer skills/experiences are able to use the VAT independently. High accuracy was obtained on the performance of the models used for error detection, which contributes to adequate automatic feedback. However, the survey points out that improvements to the system are still necessary, and all the suggestions given by the stakeholders will be considered. The inclusion of patients in the design and development of these types of software is significantly important for usability, acceptability and engagement with e‐therapy, which appears to be a good option to increase the frequency and intensity of practice while supporting the traditional methods. The VAT offers clear applicability with an added value for the Flemish health facilities, including a positive impact for a specific group of patients.

DECLARATION OF INTEREST

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
TABLE A1

Applied questionnaire

Use and operation

The program is user‐friendly

The program can be used independently

The program provides clear instructions

It is clear how to select a target sound

It is clear how to play, pause, and stop the instructions about the target sound production

It is clear how to select levels and exercises

It is clear how to go to a previous or next exercise

It is clear how you can listen to your pronunciation

I would use the program daily at home

Program and layout

The home screen is attractive

The visual support for target sound production is clear

The visual support for target sound production is attractive

The size of the buttons is adequate/sufficient

The font is clear

The font size is adequate/sufficient

The computer screen size is adequate

I would recommend the program

Exercises

The exercises are clear

The type of exercise is useful

The number of exercises per target sound is sufficient

The degree of difficulty of the exercises matches my ability

Feedback

The feedback is attractive

The feedback is clear

The feedback is easy to interpret

The feedback is useful

The feedback is motivating

The feedback ensures that my articulation improves

Use of Multimedia

I use the internet regularly

I have one or more devices for multimedia (computer, smartphone, tablet, etc.)

I use multimedia for relaxation/entertainment

I use multimedia for work

I use multimedia to search for things/information

  25 in total

1.  Intensity of aphasia therapy, impact on recovery.

Authors:  Sanjit K Bhogal; Robert Teasell; Mark Speechley
Journal:  Stroke       Date:  2003-03-20       Impact factor: 7.914

Review 2.  Motor enrichment and the induction of plasticity before or after brain injury.

Authors:  Jeffrey A Kleim; Theresa A Jones; Timothy Schallert
Journal:  Neurochem Res       Date:  2003-11       Impact factor: 3.996

3.  Using visible speech to train perception and production of speech for individuals with hearing loss.

Authors:  Dominic W Massaro; Joanna Light
Journal:  J Speech Lang Hear Res       Date:  2004-04       Impact factor: 2.297

4.  Sentactics®: Computer-Automated Treatment of Underlying Forms.

Authors:  Cynthia K Thompson; Jungwon Janet Choy; Audrey Holland; Ronald Cole
Journal:  Aphasiology       Date:  2010-10       Impact factor: 2.773

Review 5.  The science and practice of LSVT/LOUD: neural plasticity-principled approach to treating individuals with Parkinson disease and other neurological disorders.

Authors:  Cynthia M Fox; Lorraine O Ramig; Michelle R Ciucci; Shimon Sapir; David H McFarland; Becky G Farley
Journal:  Semin Speech Lang       Date:  2006-11       Impact factor: 1.761

6.  Application of the ICF in reduced speech intelligibility in dysarthria.

Authors:  Allyson D Dykstra; Mark E Hakel; Scott G Adams
Journal:  Semin Speech Lang       Date:  2007-11       Impact factor: 1.761

7.  Using Virtual Technology to Promote Functional Communication in Aphasia: Preliminary Evidence From Interactive Dialogues With Human and Virtual Clinicians.

Authors:  Michelene Kalinyak-Fliszar; Nadine Martin; Emily Keshner; Alex Rudnicky; Justin Shi; Gregory Teodoro
Journal:  Am J Speech Lang Pathol       Date:  2015-11       Impact factor: 2.408

8.  Apkinson: the smartphone application for telemonitoring Parkinson's patients through speech, gait and hands movement.

Authors:  Juan Rafael Orozco-Arroyave; Juan Camilo Vásquez-Correa; Philipp Klumpp; Paula Andrea Pérez-Toro; Daniel Escobar-Grisales; Nils Roth; Cristian David Ríos-Urrego; Martin Strauss; Helber Andrés Carvajal-Castaño; Sebastian Bayerl; Luis Reinel Castrillón-Osorio; Tomas Arias-Vergara; Arne Künderle; Felipe Orlando López-Pabón; Luis Felipe Parra-Gallego; Björn Eskofier; Luis Felipe Gómez-Gómez; Martia Schuster; Elmar Nöth
Journal:  Neurodegener Dis Manag       Date:  2020-06-22

9.  Innovative technology for the assisted delivery of intensive voice treatment (LSVT®LOUD) for Parkinson disease.

Authors:  Angela E Halpern; Lorraine O Ramig; Carlos E C Matos; Jill A Petska-Cable; Jennifer L Spielman; Janice M Pogoda; Phillip M Gilley; Shimon Sapir; John K Bennett; David H McFarland
Journal:  Am J Speech Lang Pathol       Date:  2012-10-15       Impact factor: 2.408

10.  Computerized script training for aphasia: preliminary results.

Authors:  Leora R Cherney; Anita S Halper; Audrey L Holland; Ron Cole
Journal:  Am J Speech Lang Pathol       Date:  2008-02       Impact factor: 2.408

View more
  1 in total

1.  Automatic boost articulation therapy in adults with dysarthria: Acceptability, usability and user interaction.

Authors:  Viviana Mendoza Ramos; Juan C Vasquez-Correa; Rani Cremers; Leen Van Den Steen; Elmar Nöth; Marc De Bodt; Gwen Van Nuffelen
Journal:  Int J Lang Commun Disord       Date:  2021-07-06       Impact factor: 2.909

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.