| Literature DB >> 35531568 |
Faruk Lawal Ibrahim Dutsinma1, Debajyoti Pal1, Suree Funilkul2, Jonathan H Chan1.
Abstract
Voice assistants (VA) are an emerging technology that have become an essential tool of the twenty-first century. The VA ease of access and use has resulted in high usability curiosity in voice assistants. Usability is an essential aspect of any emerging technology, with every technology having a standardized usability measure. Despite the high acceptance rate on the use of VA, to the best of our knowledge, not many studies were carried out on voice assistants' usability. We reviewed studies that used voice assistants for various tasks in this context. Our study highlighted the usability measures currently used for voice assistants. Moreover, our study also highlighted the independent variables used and their context of use. We employed the ISO 9241-11 framework as the measuring tool in our study. We highlighted voice assistant's usability measures currently used; both within the ISO 9241-11 framework, as well as outside of it to provide a comprehensive view. A range of diverse independent variables are identified that were used to measure usability. We also specified that the independent variables still not used to measure some usability experience. We currently concluded what was carried out on voice assistant usability measurement and what research gaps were present. We also examined if the ISO 9241-11 framework can be used as a standard measurement tool for voice assistants.Entities:
Keywords: ISO 9241-11 framework; Systematic literature review; Usability; User experience; Voice assistants
Year: 2022 PMID: 35531568 PMCID: PMC9063617 DOI: 10.1007/s42979-022-01172-3
Source DB: PubMed Journal: SN Comput Sci ISSN: 2661-8907
Current literature reviews
| # | Article name | Summary | Limitations | Usability focus |
|---|---|---|---|---|
| [ | Smart Home Voice Assistants: A Literature Survey of User Privacy and Security Vulnerabilities | The study explores the potential use vulnerabilities encountered while using the voice assistant. The studies looked at the vulnerabilities, associated attack vectors, and possible mitigation measures that users can take to protect themselves during the use of voice assistant | Privacy and vulnerability are not the primary focus in usability | Personal Smart Home use |
| [ | Intelligent personal assistants: A systematic literature review | The natural language interfaces allow the human–computer interaction by the translation of the human intention in the controls of the devices, the analysis of the speech or the gestures of the user. The article looked at the major trends, critical areas and challenges of an intelligent personal assistant. The study also proposed a taxonomy for IPA classification. The method used the population, intervention, comparison, outcome, and context (PICOC) criteria | The study did not conduct a thorough review of what was done with respect to the usability of the voice assistant | General use |
| [ | Virtual Assistants for Learning: A Systematic Literature Review | The motivation, commitment and decreasing interest of students in the learning process has always existed, contributing to increased failures and dropouts. This can be attributed caused due to the difficulties with time management. The growing number of students in higher education makes it impossible to provide individual tutoring and support to each student. This paper systematically examines the use of virtual assistants in tertiary education It focuses on the technology which fuels them, their characteristics and their impact in the learning process | The Study focused on voice assistants used only within an educational environment that motivates users | Education |
| [ | Voice-Based Conversational Agents for the Prevention and Management of Chronic and Mental Health Conditions: Systematic Literature Review | Chronic and mental diseases are increasingly prevalent throughout the world. As devices in our everyday lives offer more and more voice-based self-service, voice assistant can support the prevention and management of these conditions. This study highlights the current methods used in the evaluation of health interventions for the prevention and management of chronic and mental health conditions delivered through voice assistant | The study only focused on voice assistants used in the health environment alone | Health |
| [ | Tourists’ Attitudes toward the Use of Artificially Intelligent (AI) Devices in Tourism Service Delivery: Moderating Role of Service Value Seeking | This study examines tourist attitudes towards the use of voice assistants in relatively more utilitarian or hedonic (air and hotel) tourism services. The results of the study suggest that tourism acceptance of VA is influenced by social influence, hedonistic motivation, anthropomorphism, expectation of performance and exertion, and emotions towards artificially intelligent devices. These results suggest that while the use of voice Assistants in the provision of functional services is acceptable, the use of AI devices in the delivery of hedonic services could backfire | The study was not on usability, to be specific, but on adopting voice assistants in the tourism environment | Tourism |
| [ | Exploring How Older Adults Use a Smart Speaker–Based Voice Assistant in Their First Interactions: Qualitative Study Exploring How Older Adults Use a Smart Speaker–Based Voice Assistant in Their First Interactions: Qualitative Study | Smart speaker-based voice assistants promise support for the aging population, with the benefits of hands-free and eye-free interaction to process applications. This study explores how older adults experience and react to a voice assistant when they first interact with that person. The study discusses design implications that can positively influence older adults using voice assistant, including helping better understand how a voice assistant work and tailored to the needs of older adults | The Study group focused on was only older adults | General Use |
| [ | A Meta-Analytical Review of Empirical Mobile Usability Studies | This document provides a usability assessment framework tailored to the context of a mobile IT environment. The study conducted a qualitative meta-analysis of more than 100 empirically based mobile usability studies. This study included the contextual factors studied, the dimensions of core and peripheral usage measured. Furthermore, open and unstructured tasks are under-utilized, and the effects of interaction between interactivity and complexity warrant further study | The impacts of User characteristics and environment on usability were not explored in the study | Mobile computing |
| [ | Evaluation of COVID-19 Information Provided by Digital Voice Assistants | Digital voice assistants are widely used to search for health information during COVID-19. With the rapidly changing nature of COVID-19 information, there is a need to assess the COVID-19 information provided by voice assistants to meet consumer needs and prevent disinformation. The goal of this study is to evaluate the COVID-19 information provided by voice assistants in terms of relevance, accuracy, usability and reliability. The study found that information about this pandemic is evolving rapidly and that users must use good judgment when obtaining COVID-19 information from voice assistants | The study focused only on voice assistants used in Covid-19 related issues | Health |
| [ | The human side of human-Chatbot interaction: A systematic literature review of ten years of research on text-based chatbot | Over the last ten years there has been a growing interest around text-based chatbot, software applications interacting with humans using natural written language. However, despite the enthusiastic market predictions, ‘conversing’ with this kind of agents seems to raise issues that go beyond their current technological limitations, directly involving the human side of the interaction. This study suggests a number of research opportunities that could be explored over the next few years | The study focused only on chatbot with textual modality. Moreover, chatbot use a Graphic User Interface that is not present in voice assistants | General Use |
| [ | Voice in Human–Agent Interaction: A Survey | Social robots, conversational agents, voice assistants and other embodied AIs are increasingly a characteristic of daily life. The connection between these different types of intelligent agents is their ability to interact with people by voice. The voice becomes an essential mode of embodiment, communication and interaction between IT operators and end users. This study presents a meta-synthesis of the voice of agents in the conception and experience of agents from a man-centered point of view: voice assistant | The study did not use the ISO 9241–11 framework as a reference in their measurement scale | General Use |
| [ | Usability of Chabot’s: A Systematic Mapping Study | The use of chatbot has increased considerably in recent years. As a result, it is essential to integrate conviviality into their development. For this reason, it is essential to integrate conviviality in their development. The study identifies the state of the art in the conviviality of chatbot and the applied techniques of human–computer interaction, to analyze how to assess the conviviality of chatbot | The study focused only on chatbot with textual modality. Moreover, chatbot use a Graphic User Interface that is not present in voice assistants | General Use |
| [ | A Literature Review On Chatbot In Healthcare Domain | The study highlighted Chabot used in the healthcare environment. Also, it compares the techniques such as NLU, NLG, and ML used in chatbot development | The study deals with chatbot with textual modality. Also, the study deals with chatbot used only in a healthy environment | health |
| [ | Review of Chatbot Design Techniques | The study reviewed the techniques and factors considered when designing a chatbot.Also it highlighted how chatbot worked and what are the type of approaches that are available for chatbot development | The study focuses on chatbot with textual modality, which is different from a voice assistant | Commerce |
| [ | A Systematic Literature Review of Medical Chatbot Research from a Behavior Change Perspective | The study examined the literature on how people feel about using a medical chatbot in medical communication services. Moreover, The study recommended five design-orientation and highlighted the behavioral aspects such as acceptance, usage, and effectiveness when using chatbot | The study focuses on chatbot with textual modality, which has different factors than voice assistants | Health |
| [ | A review of chatbot in education: Practical steps forward | The study focused on Chatbot applied within an educational environment; it highlighted how Chatbot are currently being used in a broader educational environment. Moreover, the study also recommended how Chatbot can be applied to enhance students learning experience | The study focuses on chatbot, with textual modality, with different factors than voice assistants | Education |
| [ | Human-like communication in conversational agents: a literature review and research agenda | The study identified the voice assistant human-like behaviors that have the most effect on relational outcomes during communication | An in-depth analysis of user and conversation assistants attributes was not carried out. Moreover, the study only focused on voice assistants used in management alone | Management |
Fig. 1Article selection process
Quality assessment checklist
| Checklist | Definition |
|---|---|
| C1 | Are the study aims and objectives clearly stated |
| C2 | Is the article well designed to achieve these aims? |
| C3 | Are the independent variable in the study clearly defined? |
| C4 | Are the dependent variable in the study clearly defined? |
| C5 | Is the study discipline stated clearly? |
| C6 | Are the data collection methods clearly stated |
| C7 | Does the study explain the reliability and validity of the measures? |
| C8 | Are the analysis techniques described adequately? |
| C9 | Are the users/participants’ numbers stated clearly? |
| C10 | Do the results add to the literature? |
List of compiled articles
| # | Article name | Voice assistant type | Usability measure | Years |
|---|---|---|---|---|
| 1 | An Exploration of Speech-Based Productivity Support in the Car [ | Car Interface | Effectiveness | 2019 |
| Exploring Effects of Conversational Fillers on User Perception of Conversational Agents [ | Smart Speaker | Effectiveness, machine Voice(perceived intelligence) | 2019 | |
| 2 | ||||
| 3 | I Almost Fell in Love with a Machine”: Speaking with Computer Affects Self-disclosure [ | Software Interface | Trust | 2019 |
| 4 | Clarifying False Memories in Voice-based Search [ | Smart Speaker | Satisfaction, efficiency, cognitive load | 2019 |
| 5 | The Effects of Anthropomorphism and Non-verbal Social Behavior in Virtual Assistants [ | Smart Speaker Humanoid | machine Voice(perceived humanness, social presence), cognitive load(attention) | 2019 |
| 6 | An End-to-End Conversational Style Matching Agent [ | Smart Speaker | Trust | 2019 |
| 7 | Tandem Track: Shaping Consistent Exercise Experience by Complementing a Mobile App with a Smart Speaker [ | Smart Speaker Software Interface | Efficiency, Effectiveness | 2020 |
| 8 | Mapping Perceptions of Humanness in Intelligent Personal Assistant Interaction [ | Smart speaker Software Interface | Machine voice(perceived Humanness), Effectiveness | 2019 |
| 9 | Pattern of Gaze in Speech Agent Interaction [ | Humanoid | Machine voice (Social presence), cognitive workload | 2019 |
| 10 | Conversational Interfaces for a Smart Campus: A Case Study [ | Smart Speaker Software Interface | Effectivity | 2020 |
| 11 | Mental Workload and Language Production in Non-Native Speaker IPA Interaction [ | Smart Speaker Software Interface | Cognitive Load, Satisfaction | 2020 |
| 12 | User Experience of Alexa when controlling music – comparison of face and construct validity of four questionnaires [ | Smart speaker | User satisfaction | 2020 |
| 13 | Machine Body Language: Expressing a Smart Speaker’s Activity with Intelligible Physical Motion [ | Humanoid | Machine Voice (Perceived humanness) | 2020 |
| 14 | Measuring the anthropomorphism, animacy, likeability perceived intelligence and perceived safety of robots [ | Humanoid | Machine Voice (Perceived humanness, Anthropomorphism) | 2008 |
| 15 | At Your Service: Designing Voice Assistant Personalities to Improve Automotive [ | Car interface | Attitude(Likeability, acceptance) | 2019 |
| 16 | Hey, Siri”, “Ok, Google”, “Alexa”. Acceptance- Relevant Factors of Virtual Voice-Assistants [ | Smart Speaker Software Interface | Attitude (Trust acceptance) | 2019 |
| 17 | User experience with smart voice assistants: the accent perspective [ | Smart Speaker Software Interface | User satisfaction | 2019 |
| 18 | Empathy is all you need: How a conversational agent should respond to verbal abuse [ | Software Interface | Effective, User satisfaction,Machine voice (social presence) | 2020 |
| 19 | Gendered Voice and Robot Entities: Perceptions and Reactions of Male and Female Subjects [ | Humanoid | User satisfaction, attitude, effectiveness | 2009 |
| 20 | What If Conversational Agents Became Invisible? Comparing Users’ Mental Models According to Physical Entity of AI Speaker [ | Smart speaker | Attitude(trust), machine Voice(Anthropomorphism) | 2020 |
| 21 | Similarity is more important than expertise: Accent effects in speech interfaces [ | Smart Speaker | Effectiveness, attitude(Trust), Efficiency, satisfaction | 2007 |
| 22 | Can Computer-Generated Speech Have Gender? An Experimental Test of Gender Stereotype [ | Software Interface | Attitude, User satisfaction | 2020 |
| 23 | Designing Social Presence of Social Actors in Human Computer Interaction [ | Software Interface | satisfaction | 2003 |
| 24 | Improving Automotive Safety by Pairing Driver Emotion and Car Voice Emotion [ | Software Interface | Effectiveness, Efficiency | 2005 |
| 25 | Designing Emotional Expressions of Conversational States for Voice Assistants: Modality and Engagement [ | Humanoid | Cognitive load, Attitude | 2018 |
| 26 | The Use of Voice Input to Induce Human Communication with Banking Chabot’s [ | Smart Speaker | Attitude | 2018 |
| 27 | Face Value? Exploring the Effects of Embodiment for a Group Facilitation Agent [ | Smart Speaker Software Interface | Attitude | 2018 |
| 28 | Trust in artificial voices: A “congruency effect” of first impressions and behavioral experience [ | Humanoid | Attitude | 2018 |
| 29 | Children Asking Questions: Speech Interface Reformulations [ | Smart Speaker | Cognitive load | 2018 |
Fig. 2Year of publication of selected articles
Fig. 3Embodiment of Voice assistant used in selected studies
Fig. 4Categories of independent variable use over the years
Independent variables and their categorization
| Category definition | Independent variable | Instances | Applications | Environment |
|---|---|---|---|---|
Voice The voice category comprised of independent variables that are associated with the voice assistants, these are attributes that the voice assistants possess) | Voice personalities | (Energetic vs Subdued), (Introvert and extrovert) | A study Paired the driver's emotions with that of the Car Voice Emotion state (Energetic and Subdued) to test the effectiveness of similarity between voice and user personality [ | Simulation Experiment, Controlled Environment, |
| Voice gender | Male vs Female | Studies compared different gender voices (Male and Female) to measure social interaction and trust. Studies showed male voice has a more dominating effect on users than female voice [ | Controlled Environment, Free real environment | |
| Voice Accent | Standard Southern British English accent VS Liverpool accent Vs Birmingham accent Vs synthetic voice, American Accent vs Swedish Accent, Native English speaker vs non-English speaker | Participants create trust expectancy based on the voice accent. The participants tend to trust information with a similar accent, more knowledgeable, sophisticated voice Assistants [ | Controlled Environment, | |
People The people category comprised of independent variables that are associated with the users, these are attributes that the users can have | People gender | Male vs Female | Studies showed that Males and females view voice assistants differently in different aspects. Both genders have different takes in the form of embodiment of the voice assistants. Moreover, women trust voice assistants with a female voice. However, in a situation where there is a need for convincing the male voice assistants is more efficacious [ | Controlled Environment, Free real environment, mixed environment |
| Personality | Introvert vs Extrovert, happy vs upset | A study showed that a person's emotional state or personality could be affected the personality of the voice assistant. [ | Simulation Experiment, Controlled Environment, | |
| Query expression | Abuse (Insult, Threat, Swearing) | A study instructed the user to insult the voice assistants while communicating with it, and the VA's response affected the user's outlook and involved usability [ | Controlled Environment | |
| Experience | UX metric, Self-Efficacy | The study Measured the user face validity and construct validity by correlating UX scores of questionnaires with each other. Another study shows that Participant self-efficacy and experience affect the trust, privacy and language performance of the Voice Assistant [ | Controlled Environment, survey | |
| Voice accent | American Accent vs Swedish Accent, Native English speaker vs non-English speaker | Participants tend to trust information with a similar accent, then more knowledgeable content, English native speakers do exhaust more mental models when interacting with voice assistants [ | Controlled Environment, Free real environment, mixed environment | |
Task characteristic The Task characteristic Comprised of independent variables that are associated with tasks that it’s expected the user to carry out during the interaction, this also include the modality of the task | Modality | Voice mode, Textual mode, VA Facial Expression mode. (Smiley) Mixed Interface | A study used modality to test the social presence of the VA. The study shows participants feel a strong social presence when textual modality personality matches the voice personality. Another study showed that nonverbal emotional expressions such as Text box movement and VA Facial Expression mode (Smiley) affect user engagement [ | emotional expression design experiment interactive task, controlled Environment, |
| Context | Interactive Task, Drawing Task, Executable Task, Driving simulation task, auditory Task Controlling device Volume, audio speech to text | A study used the speech to text as a task on users during driving. The study measured driver engagement and concentration during driving.[ | emotional expression design experiment interactive task, controlled Environment, free real life environment, simulation | |
Conversational Style This is the nature of the conversation from either the user during query or the response of the voice assistants | Response type | Empathetic (Avoidance vs Empathy vs Counterattack) Clarifying Query (No modification vs direct Modification vs negatively clarified) Conversational Fillers (“um”,huh, uh) | The VA response affects the user usability experience, and A Study showed that When VA are insulted, their response type affects the participant's emotional engagement and attitude [ | Controlled Environment, |
| Communication Form | High Consideration(indirect) VS High Involvement (direct) | A study used participant High Consideration and High involvement linguistic style to realize. It is effective when used with a similar voice assistant's linguistic style. [ | Controlled Environment, Mixed Environment | |
Anthropomorphic cues These are independent variables on voice assistants that exhibit human attributes and intelligence, this make the user perceive the voice assistants as human | Speech agents’ Personification vs Speech agent personalization | A study compared VA personation, personalization, and neither to measure users' trust and engagement when used by children and adults. The result showed the personalized VA has the highest concentration and trust [ | Controlled Environment, | |
| Embodiment type (audio Vs smart speaker), or (gaze vs no gaze), humanoid robot, Smart Speaker vs Anthropomorphic Robot (AMR). vs The Anthropomorphic Social Robot (AMSR) | Numerous studies have used embodiment type to measure usability; a study compared a VA with gaze with another VA without gaze to measure the user anthropomorphism. Another study compared physical smart speakers with the absence of speakers but just voice to test the user trust and engagement [ | Controlled Environment, | ||
Fig. 5Technique used in our studies over the span period of time
Fig. 6Usability measurement used over the years on our compiled articles
Fig. 7Percentage of ISO 9241–11 framework usability measures and non ISO 9241–11
Relationship between independent variables and ISO 9241–11 framework measurement
| Dependent variables | Effectivity | Efficiency | Satisfaction | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Independent variables | Productivity | Performance | Value | Learnability | Optimization | Ease of use | Feasibility | Decision making | User experience | Continued use | Conformity |
| Voice Assistants | |||||||||||
| Personalities | x | x | x | x | x | ||||||
| Gender | x | x | x | x | x | ||||||
| Accent | x | x | x | x | x | ||||||
| People | |||||||||||
| Gender | x | x | x | x | |||||||
| Personality | x | x | x | ||||||||
| Query expression | |||||||||||
| Experience | x | x | x | ||||||||
| Accent | x | x | x | x | x | x | |||||
| Task characteristics | |||||||||||
| Modility | x | x | x | x | x | x | x | ||||
| Context | x | x | |||||||||
| Communication type | |||||||||||
| Response type | x | x | x | x | x | ||||||
| Conversational type | x | x | x | x | x | x | |||||
| Anthropomorphism | |||||||||||
| Embodiement type | x | x | x | x | x | x | |||||
| Humanoid/robott | x | x | |||||||||
| Smart Speaker, Robot,Anthropormorphic Robot | x | x | x | x | x | x | |||||
Relationship between independent variables and non- ISO 9241–11 framework measurement
| Dependent variables | Attitude | Machine voice | Cognitive load | |||||
|---|---|---|---|---|---|---|---|---|
| Independent variables | Trust | Likeability | Acceptance | Perceived intelligence | Perceived humanness | Social presence | Mental workload | Attention |
| Voice assistant | ||||||||
| Personalities | x | x | x | x | ||||
| Gender | x | x | x | x | x | |||
| Accent | x | x | x | x | x | x | ||
| People | ||||||||
| Gender | x | x | x | |||||
| Personality | x | x | x | x | x | |||
| Query expression | x | x | ||||||
| Experience | x | x | x | x | ||||
| Accent | x | x | x | x | x | x | ||
| Task Characteristics | ||||||||
| Modality | x | x | x | x | x | x | x | |
| Context | x | x | x | x | ||||
| Communication Type | ||||||||
| Response type | x | x | x | x | x | x | ||
| Conversational type | x | x | x | x | x | x | ||
| Anthropomorphism | ||||||||
| Embodiment type | x | x | x | x | x | x | x | |
| Humanoid/robot | x | x | x | x | x | |||
| Smart speaker, robot, anthropomorphic robot | x | x | x | x | ||||