| Literature DB >> 31697237 |
Ahmet Baki Kocaballi1, Shlomo Berkovsky1, Juan C Quiroz1, Liliana Laranjo1, Huong Ly Tong1, Dana Rezazadegan1, Agustina Briatore2, Enrico Coiera1.
Abstract
BACKGROUND: The personalization of conversational agents with natural language user interfaces is seeing increasing use in health care applications, shaping the content, structure, or purpose of the dialogue between humans and conversational agents.Entities:
Keywords: adaptive systems; conversational agents; conversational interfaces; customization; dialogue systems; health care; personalization
Mesh:
Year: 2019 PMID: 31697237 PMCID: PMC6873147 DOI: 10.2196/15360
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
An assessment scheme for personalization.
| Assessment categories | Description | ||
|
| How the user models needed by personalization are constructed. | ||
|
| Implicit | Information needed for user models is obtained automatically through the analysis of observed user activities and interactions with the system (eg, analyzing users’ conversational history to determine the suitable times to send a reminder). | |
|
| Explicit | Information needed for user models requires users’ active participation in obtaining the required information (eg, selecting the preferred times to receive a reminder). | |
|
| For whom to personalize. | ||
|
| Individuated | Personalization is targeted at a specific individual (eg, sending a reminder based on the unique profile of a single user). | |
|
| Categorical | Personalization is targeted at a group of people (eg, sending a reminder based on a shared profile of a group of users). | |
|
| What to personalize. | ||
|
| Content | The information itself (eg, alerts or reminders). | |
|
| User interface | How the information is presented (eg, using larger font sizes for elderly users or shortening prompts for experienced users). | |
|
| Delivery channel | The media through which information is delivered (eg, sending a reminder as a text message instead of a voice message). | |
|
| Functionality | What users can do with the system (eg, making different system functionalities available for patients and carers). | |
| Purpose | The purpose of personalization (eg, increasing user engagement or motivation). | ||
| Evaluation | The methods to evaluate personalization (eg, using interview questions or standardized questionnaires). | ||
| Outcomes | The outcomes in relation to personalization (eg, increased user engagement or motivation). | ||
aAdapted from Fan and Poole [7].
Figure 1Preferred Reporting Items for Systematic Reviews and Meta-Analyses flow diagram.
Personalization features of conversational agents in the included studies.
| Conversational agent (author, year) | CAa purpose | Automation (the basis for personalization) | Target (for whom to personalize) | What to personalize | |
| Content | User interface | ||||
| Tess (Fulmer et al, 2018) [ | Delivery of cognitive behavioral therapy to reduce symptoms of depression and anxiety in college students |
Explicit: Expressed emotions and mental health concerns of participants to provide personalized responses. Users' feedback and reported mood used to tailor interventions |
Individuated |
Personalized conversations based on emotions and mental health concerns Personalized therapeutic choices based on user feedback | NRb |
| Wysa (Inkster et al, 2018) [ | Wellbeing support app for users with symptoms of depression, aiming to build mental resilience and promote mental wellbeing |
Explicit: User responses to built-in assessment questionnaire and emotions expressed in a written conversation |
Individuated |
Personalized conversational pathways based on a user’s interaction, messages, and context | NR |
| Reflection Companion (Kocielnik et al, 2018) [ | Support reflection on personal physical activity data from fitness trackers |
Explicit: Users enter their behavior change goals and demographic data Implicit: Observed physical activity of the user |
Individuated |
Dialogues to encourage reflection Incorporating user goals into adaptive mini-dialogues Follow-up questions based on users’ earlier responses Visualization of past physical activity | NR |
| Relational Agent (Sillice et al, 2018) [ | Promote regular exercising and sun protection |
Explicit: Users provide their demographic information, exercising habits, sun protection behaviors and lifestyle goals Implicit: CA tracks user progress to send reminders if needed |
Individuated |
Acknowledgement of difficulties and tailored strategies to overcome these Feedback on progress and encouragement for achieving goals A weekly tracking chart to help participants monitor their exercise and sun protection behaviors Email reminders to support retention | NR |
| Woebot (Fitzpatrick et al, 2017) [ | Deliver cognitive-behavioral therapy for anxiety and depression to college students |
Explicit: Users enter their mood and goals |
Individuated |
Empathic responses tailored to the reported mood Tailoring of support content depending on the reported mood Daily prompting messages to initiate a conversation Weekly charts depicting the reported mood and textual summary | NR |
| Social Skills Trainer (Tanaka et al, 2017) [ | Social skills training for people with autism spectrum disorders |
Implicit: CA analyzes the user's audio-visual features, facial expression (smile), and head position to determine its feedback and then performs feature selection |
Individuated |
Personalized score showing similarity to a role model with respect to 10 features Encouraging comments to reinforce motivation, based on features closest to the model Comments on the points that need improvement, based on features dissimilar to the model Homework challenges for participants to complete on their own time throughout the week | NR |
| mASMAAc (Rhee et al, 2014) [ | Facilitate asthma symptom monitoring, treatment adherence, and adolescent-parent partnership |
Explicit: Users enter symptoms, activity level, and use of rescue and control medications |
Individuated |
Automated inquiries and reminders sent according to user-defined preferences on monitoring symptoms and managing medications and activity Processing of and responses to user-initiated messages at any time Daily report summarizing symptoms, activity, and use emailed to parents | NR |
| Chris (Hudlicka, 2013) [ | Embodied CA that provides mindfulness training and coaching |
Explicit: Users answer questions asked by the CA and set preferences via multiple-choice questions |
Individuated |
CA’s facial expressions and its responses adapting to the users’ learning needs and motivational state CA's affective reaction adapting to the users' utterances Conversational expressions communicating mental state Customized advice about meditation practice, based on the expressed concerns | Using didactic, relational, or motivational conversational styles according to the user models |
| DI@l-log (Harper et al, 2008; Black et al, 2005) [ | Voice logbook to document home monitored data by diabetes patients |
Explicit: Users provide weight, blood sugar and blood pressure values |
Individuated |
An alert feature generating a verbal warning if readings are too high Personalized feedback to patients on their current progress | NR |
| Pain Monitoring Voice Diary (Levin and Levin, 2006) [ | Real-time collection of information from patients for health, behavioral, and lifestyle studies and monitoring |
Explicit: Users answer a series of questions about their pain (location, type, intensity, etc) Implicit: CA utilizes previous sessions to provide personalized content and conversational style |
Individuated Categorical (novice and experienced users) |
Content (what data is collected) and style (how it is collected) of the reporting session Adaptive question-asking (additional questions for follow-ups to sessions with high levels of pain) Adaptive interruptions to better support experienced users | Adaptive conversational style (eg, shorter question formats for follow-up sessions) |
| Intelligent dialogue system (Giorgino et al, 2004; Azzini et al, 2003) [ | Home care and data acquisition from hypertension patients |
Explicit: Users answer questions about heart rate, pressure, weight, compliance, and more Implicit: CA changes its behavior depending on the progress of the current call and the clinical history of the caller |
Individuated |
The questions to be asked were determined by user profiles Gives advice on recommended health behavior and next visits Issues alerts and prompts | NR |
aCA: conversational agent.
bNR: not reported.
cmASMAA: mobile phone-based asthma self-management aid.
Personalization purpose, evaluation, and outcomes in the included studies.
| Conversational agent (author, year) | Personalization | ||
| Purpose | Evaluation | Outcomes | |
| Tess (Fulmer et al, 2018) [ |
To improve depression and anxiety symptoms To provide more engaging and convenient user experience To provide appropriate response and strategies based on the users’ reported emotion and health concerns |
Questionnaires to measure depression (PHQ-9a) [ Custom-built user satisfaction questionnaire Number of messages to measure user engagement |
Significantly lower depression ( 86% (43/50) of participants satisfied with CAe (sm) Comparable levels of daily engagement (bmf) |
| Wysa (Inkster et al, 2018) [ |
To develop positive self-expression and create a responsive self-reflection environment To encourage users to build emotional resilience skills |
Questionnaire to measure depression (PHQ-9) Thematic analysis of the responses to the in-app feedback questions User engagement through analysis of raised objections and thematic analysis of in-app feedback |
Significant reduction in depression scores in both high ( 67% (191/282) of users reporting on positive app experience (sm) More than 99% (6555/6611) of detected objections were correct (bm) |
| Reflection Companion (Kocielnik et al, 2018) [ |
To trigger deeper reflection, which would increase motivation, empowerment, and adoption of new behavior To provide engaging, novel, and diverse conversations around reflection |
Questionnaires to measure health awareness [ Willingness to use the system, number, and length of responses as measures of engagement Responses to mini-dialogues Semi-structured post-study interviews |
Significant increases in habitual action ( Prolonged use of CA (additional two weeks) by half of the participants (16/33) with an avg of 98.4-character response length in this period (bm) High response rates: 96% (443/462) of initial and 90% (386/429) of follow-up questions (bm) Mini-dialogues successfully supporting discussions on awareness related to goal accomplishment, self-tracking data, and trends in behaviour (nqi, sm) Interviews indicating an increase in awareness, mindfulness, and motivation; understanding of alternatives and actions; and newly discovered insights (sm) |
| Relational Agent (Sillice et al, 2018) [ |
To increase user engagement and promote more effective behavior change To monitor exercise and sun protection behavior To provide strategies to overcome the reported barriers |
Interviews to assess user experience and a 10-point Likert scale to measure satisfaction with interventions |
The levels of satisfaction ranged between 7 and 10 on a scale of 1 to 10 (sm) Most participants reporting on: (1) positive interactions with the CA (32/34; 94%); (2) tailored feedback supporting regular exercising and sun protection behaviors (29/34; 85%); and (3) email reminders helping to remain on track with the program (23/34; 68%; sm) |
| Woebot (Fitzpatrick et al, 2017) [ |
To engage individuals with CA through managing conversation tailored to the reported mood |
Questionnaires to measure depression (PHQ-9), anxiety (GAD-7), and affect (PANAS) Custom-built questionnaire to measure user satisfaction, emotional awareness, learning, and relevancy of content |
Significant reduction in depression symptoms ( Significantly high level of overall satisfaction ( |
| Social Skills Trainer (Tanaka et al, 2017) [ |
To provide personalized feedback aimed at improving narrative social skills |
Experienced human social skills trainer assessed the participants' narrative skills |
Improvements in the overall narrative and social skills (Study 1, |
| mASMAAj (Rhee et al, 2014) [ |
To make the system more appealing and elicit greater and longer interest in and use of the system |
Six routine asthma-diary questions Focus group interviews to evaluate user experience with CA |
Improved self-management, treatment adherence, accessibility of advice, awareness of symptoms, and sense of control (nq, sm) CA was found to be easy-to-use, convenient, and appealing (nq, sm) |
| Chris (Hudlicka, 2013) [ |
To deepen the relationship with the user To support pedagogical strategies necessary for effective training of mindfulness meditation To provide the coaching required to initiate and maintain regular practice To provide interactions for maintaining motivation via empathic dialogue and customized advice |
Custom-built questionnaires to assess the overall experience, meditation frequency, knowledge of mindfulness, sense of self-efficacy, and stages of change within the transtheoretical model of change |
Improved outcomes with CA group compared to a self-administered program: (1) more frequent and longer mindfulness training sessions ( Neutral to mildly positive feedback on CA's ability to provide customized feedback (0.3 on a –2 to +2 Likert scale; sm) |
| DI@l-log (Harper et al, 2008; Black et al, 2005) [ |
To provide personalized feedback on the patient's health status and increase their engagement |
Task completion rate and time Number of personalized alerts Qualitative interviews |
92.2% (190/206) successfully completed calls, shortening calls over time, and effective alerts leading to 12 therapeutic interventions (bm) [ 90.4% (38/42) successfully completed calls, users’ appreciation of the personalization and reports on empowerment, peace-of-mind, and sense of care (bm, sm) [ |
| Pain Monitoring Voice Diary (Levin and Levin, 2006) [ |
To shorten the dialogue sessions To provide the users a feeling of continuity To have flexible and adaptive support for different types of users |
Session length, completion rate, and turn duration Ratio of prompt interruptions by users |
97% (171/177) of sessions completed with 98% (849/859) input accuracy (bm) Shortening dialogues over time (avg 1.2 seconds over 7 sessions; bm) More prompt-interruptions by the experienced users (73% of the prompts) compared to the novice users (59% of the prompts; bm) |
| Intelligent dialogue system (Giorgino et al, 2004; Azzini et al, 2003) [ |
To improve the quality of system dialogues To increase patient compliance with guidelines |
Reliability and recognition error rate Time spent in learning to use the system |
Recognition rate up to 41%-81% (bm) Dialogue time of 3.3-5.9 minutes, with 80% (74/93) of the expert users’ dialogues achieving conclusion (bm) |
aPHQ-9: Patient Health Questionnaire 9-item scale.
bGAD-7: Generalized Anxiety Disorder 7-item scale.
cPANAS: positive and negative affect schedule 20-item scale.
dsm: self-reported measure.
eCA: conversational agent.
fbm: behavioral measure.
gFMI: Freiburg Mindfulness Inventory.
hRQ; Reflection Questionnaire.
inq: not quantified.
jmASMAA: mobile phone-based asthma self-management aid.