| Literature DB >> 35408238 |
Abdullah Bin Sawad1, Bhuva Narayan2, Ahlam Alnefaie1, Ashwaq Maqbool3, Indra Mckie2, Jemma Smith4, Berkan Yuksel1, Deepak Puthal5, Mukesh Prasad1, A Baki Kocaballi1.
Abstract
This paper reviews different types of conversational agents used in health care for chronic conditions, examining their underlying communication technology, evaluation measures, and AI methods. A systematic search was performed in February 2021 on PubMed Medline, EMBASE, PsycINFO, CINAHL, Web of Science, and ACM Digital Library. Studies were included if they focused on consumers, caregivers, or healthcare professionals in the prevention, treatment, or rehabilitation of chronic diseases, involved conversational agents, and tested the system with human users. The search retrieved 1087 articles. Twenty-six studies met the inclusion criteria. Out of 26 conversational agents (CAs), 16 were chatbots, seven were embodied conversational agents (ECA), one was a conversational agent in a robot, and another was a relational agent. One agent was not specified. Based on this review, the overall acceptance of CAs by users for the self-management of their chronic conditions is promising. Users' feedback shows helpfulness, satisfaction, and ease of use in more than half of included studies. Although many users in the studies appear to feel more comfortable with CAs, there is still a lack of reliable and comparable evidence to determine the efficacy of AI-enabled CAs for chronic health conditions due to the insufficient reporting of technical implementation details.Entities:
Keywords: chatbot; conversational agents; dialogue systems; relational agents
Mesh:
Year: 2022 PMID: 35408238 PMCID: PMC9003264 DOI: 10.3390/s22072625
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Overview and characteristics of included studies.
| Author, Year | Study Location | Type of Chronic Condition | Study Aim | Study Type and Methods | Participants’ Characteristics |
| |
|---|---|---|---|---|---|---|---|
| User Experience | Health Related Measures | ||||||
| Azzini et al., 2003 | Italy | Hypertension (patients with essential hypertension) | Data collection, developing a prototype home monitoring system. | Fifteen patients with no information about age, gender and duration. | Not reported | Not reported | |
| Baptista et al., 2020 | Australia–New South Wales, Queensland, Victoria, and Western Australia | Diabetes–Type 2 (T2D) | Self-management, education, and support. | Ninety-three patients from My Diabetes Coach app. | User experience feedback is as the following: |
Improving self-management. Older people have more interaction ( Participants who were interviewed showed more interactions with Laura ( | |
| Beaudry et al., 2019 | America-Vermont | Chronic condition (teenagers with pediatric Inflammatory Bowel Disease, Cardiology, or Type 1 Diabetes) | Learning self-care for teenagers (transition from pediatric to adult) with a chronic condition. | Thirteen teenagers from the University of Vermont |
Participants agreed that overall, text messaging was the right channel for them, and the rate of one message per week was preferred. Avg. response for patients = 97%. | Participants suggest this chatbot should be expanded, and that it shows promise to help teenagers attain self-care skills on the transition journey. | |
| Bickmore et al., 2010 | America-Boston | Depressive Symptoms | Hospital patients know about their post-discharge self-care regimen through an automated system. | One hundred and thirty patients from Boston Medical Centre. |
All patients rated the agent with high satisfaction besides ease of use. Most patients (76%) preferred receiving discharge information from the agent instead of doctors or nurses. |
Patients with major depressive symptoms showed more desire to continue with the agent ( The following are the attitude measures towards the agent: satisfaction: usability: continue relationship: preference: adherence: | |
| Bickmore et al., 2010 | America-Pennsylvania | Schizophrenia | Promoting antipsychotic medication adherence for patients with schizophrenia. | Twenty patients from a mental health outpatient clinic. |
Sixteen participants out of the total completed the study of 1 month–daily use for ten min. The following are the participants’ ratings out of 5: trust: 4.4; liking: 4.3; satisfaction: 4.5; ease of use: 4.3; keep going with the system: 4.4. |
Self-reported medication and physical activity adherence through all measures were very high (84–89%). Relationship with the agent was significantly correlated with system use ( | |
| Bott et al., 2019 | America-New York | Loneliness, Depression, Delirium, Falls | Supporting nurses and mitigating risks of hospitalization for elders. | Ninety-five elders from an urban community hospital in New York. | The mean for patient engagement data was as follows: |
Delirium: intervention group: significant reduction ( Loneliness: the intervention group experienced a decrease in loneliness compared with the control group ( Depression: no significant difference between the two groups (intervention and control). Falls: the fall rate reduced by 82% in the intervention group while rate increased in the control group. | |
| Chaix et al., 2019 | France and Europe | Breast Cancer | Support, education, and improving medication adherence. | Analysis for the conversations between patients and chatbot (Vik). |
The satisfaction rate was very high, 93.95% (900/958). Vik chatbot helpful and supported by 88% (943/958). | Not reported | |
| Dworkin et al., 2018 | America- Chicago | HIV | Promoting HIV medication adherence and retention in care. | Sixteen men participated (African American men who have sex with men) recruited from four Universities of Illinois at Chicago. |
All participants welcomed positive messages and images, while some participants did not welcome the negative messages and images. Participants liked the interaction with the instructional avatar. The first four focus groups showed that stigma emerged as a critical issue, but there were no concerns by the fifth group. Avatar was acceptable by almost all participants, except four, who hoped for an option to choose a female version. |
The study revealed that stigma at different levels should be considered. Negative images can overwhelm participants and make them want to turn off the app and not return to it. | |
| Easton et al., 2019 | UK | Patients with an Exempla r Long-Term Condition (LTC; Chronic Pulmonary Obstructive Disease (COPD)) | Data collection, support, self-management, and diagnosis. | Ten patients were identified through the local British Lung Foundation Breathe Easy support group. |
Almost half participants strongly agreed to use this system frequently. Easy to use: 88%. Needing technical support: 50%. | Not reported | |
| Greer et al., 2019 | America | After Cancer Treatment | Support and follow-up | Forty-five young adults from Facebook advertising, survivorship organizations and direct email. |
The feedback of the chatbot is nonjudgmental. The chatbot was helpful: 64%. Recommend it to a friend: 69%. |
After 4 weeks, participants in the experimental group reported an average reduction in anxiety of 2.58 standardized t-score units. Mixed-effects models revealed a trend-level ( The experimental group also experienced greater reductions in anxiety when they engaged in more sessions ( There were no significant effects by group on changes in depression, positive emotion, or negative emotion. | |
| Hauser-Ulrich et al., 2019 | German and Swiss | Self-Management of Chronic Pain | Pain self-management | One hundred and two participants from the SELMA app. |
Participants mentioned the app was useful and easy to use. The avg. answer ratio of participants in the intervention group: 0.71. |
In relation to impairment and pain intensity, the intention to change behavior was positive ( Compared with the control group, The intervention group did not show a significant change in pain-related impairment ( | |
| Inkster et al., 2018 | America- Brooklyn and Chicago | Symptoms of Depression | Data collection and self-reported symptoms of depression | One hundred and twenty-nine users from the Wysa app (high users, |
Seventy-five users found the app favorable (82%). Thirteen users claimed that the app does not understand or repeat itself (14%). Seventy-five users commented that the app is not helpful (82%). The high users’ group had a highly significant improvement average ( | Not reported | |
| Lobo et al., 2017 | Portugal | Heart Failure Care and Pharmacological Information | Managing information about medicines and increasing adherence | Eleven native Portuguese adults. |
System naturalness, information quality, and coherence scores were consistent among participants. Participants need an initial time on the system to know how it worked. The majority of the participants faced difficulties with the speech recognition of some keywords. | CARMIE has proven the capability of addressing the pharmacological and treatment information for heart failure daily care. | |
| Neerincx et al., 2019 | Netherlands and Italy | Diabetes–Type 1 (T1DM) | Support and manage children diabetes | Children from diabetes camps and hospitals in Netherlands and Italy. |
Cycle1: Children have increased knowledge of T1DM. Children like the PAL actor (robot and its avatar). Children experience diabetes-related activities more positively. Cycle2: Children bond with the PAL actor via the robot and its avatar. Children are motivated to work on their personal objectives with PAL. >Cycle3: Children have increased situated knowledge on T1DM. Children are aware of the T1DM state and causes and develop self-efficacy. Children have a higher Quality of Life concerning T1DM. Children seamlessly follow the culture and hospital-dependent diabetes management processes. Children pursue relatively difficult goals. |
Children in the intervention groups had a stronger increase in self-care score ( No effect on diabetes related quality of life in children ( | |
| Rehman et al., 2020 | Korea | Glaucoma and Diabetic Conditions | Data collection and diagnosing | Thirty-three international students from the University of Kyung Hee. | Using Cronbach’s Alpha Coefficient correlation of items per scale: | Not reported | |
| Stephens et al., 2019 | America- Boston | Obesity and Prediabetes | Self-reported progress, support and follow-up with a clinician | Twenty-three youths with obesity symptoms from children’s healthcare system. | Ninety-six percent of the total patients reported this chatbot is helpful. | Not reported | |
| O’Hara et al., 2008 | America | Intellectual Disabilities; Poor Dental Hygiene | Education and self-management | Thirty-six participants from a single dental practice. No information about age and gender; 9 participants left study partway through; duration: 6 months. | More than half of participants reported PDAs not functioning correctly (mostly problems keeping the battery charged). | Ten participants (40%) achieved improvement in at least three areas of oral health. | |
| Philip et al., 2017 | France | Major Depressive Disorders (MDD) | Clinical interviews (179 participants with major depressive disorders; interview 1 with CA, interview 2 with sleep clinic psychiatrist). | One hundred and seventy-nine outpatients from a sleep clinic in Bordeaux University Hospital. |
Acceptability was good—25.4 (E-scale 0–30). Seventy-three percent of patients scored above 24. | Not reported | |
| Piau et al., 2019 | France | Cancer (Geriatric Oncology) | Data collection | Nine participants (undergoing chemotherapy after cancer diagnosis). |
Ninety-seven percent satisfaction on the chatbot overall. Eighty-seven percent considered monitoring Useful. Eighty percent satisfied with monitoring frequency. Most valuable benefits were moral support 44% and treatment management 40%. | Not reported | |
| Puskar et al., 2011 | America | Schizophrenia | Treatment, support and education. | Seventeen patients completed the study, but only results from two participants were mentioned in the study. |
Patient 1: Mr. Z. Found the tracking system easy and simple to use. Patient 5: Ms. Q. Noted that Laura asks too many questions, then became okay with this after more explanation. After the trial period, Ms. Q understood her illness and the importance of taking meds and felt Laura was “on her side”. In addition, the ability to choose the time of day gave her freedom. | Before CA, the participants had an adherence level of 21%, but with the CA, the rate rose to 46%. | |
| Richards and Caldwell, 2018 | Australia | Urinary Incontinence | Treatment and education | Children with urinary incontinence. |
Fifty-one (69.9%) parents and 45 (61.5%) of children were happy with the treatment. Satisfaction results—11 questions asked—all mean results between 2.37–3.36 on 5-point scale (1 = seldom, 5 = always). Usability results—nine questions asked—all mean results between 2.66–3.13 on a 4-point Likert scale (1 = strongly disagree, 4 = strongly agree, no neutral options). |
Fifty-four participants (74%) reported being dry during the day for 14 consecutive days, and 19 participants (26%) reported being dry during the night for 14 consecutive days. Thirty-seven percent of the total participants reported decreased severity of symptoms. | |
| Ryu et al., 2020 | South Korea | Mental Health (Depression and Anxiety) | Treatment | Initial testing had 24 older adults. |
Most users preferred text-based interaction. Users that experienced technological usability issues preferred voice interaction. Eighty-three percent of users were positive about cognition-enhancing games. |
The study reported a reduction in depression ( | |
| Schroeder et al., 2018 | America | Mental Health | Treatment and education | Seventy-three participants. |
Users trusted the chatbot as it was based on Dr Marsha Linehan’s material on DBT. Some felt the chatbot was hard to engage with, too generic and impersonal. The reminders by the app helped the participants stay engaged. Some participants felt their learning was hindered by their inability to ask questions. |
The study showed a significant reduction of depression (PHQ-9, linear and quadratic Participants saw a reduction in dysfunctional coping ( | |
| Sebastian & Richards, 2017 | Australia | Mental and Physical Health (Anorexia Nervosa) | Education and increased awareness | Two hundred and forty-five undergraduate university students. |
Participants improved their ability to recognize Anorexia Nervosa over time ( Participants less likely to recognize Anorexia Nervosa as another form of eating disorder over time ( |
Participants were found to have improved recognition of Anorexia Nervosa as a mental illness over time ( Both positive and negative volitional stigma was significantly lower in post-intervention 1 ( Education strategies for females reduce traditional stigma in females ( | |
| Shamekhi & Bickmore, 2018 | America | Various Chronic Conditions; Pain, Anxiety, and Depression | Treatment and coaching | Respiratory RCT had 21 participants. |
Eighty-six percent of participants found the agent was more accurate with respiratory sensors. Many participants preferred breathing over tapping a screen as it was less distracting. Most participants found the synthetic voice acceptable, some enjoyed it, and few found it robotic. Most participants found the appearance of the agent calming. Most participants became more aware of their breathing after agent feedback. |
The following shows the evaluation results: Instructor: Sensor ( Meditation experience: Sensor ( Interactivity: Sensor ( | |
| Tielman et al., 2017 | Netherla-nds | Post-Traumatic Stress Disorder (PTSD). | Treatment. | Four participants. |
Participants rated the system usability scale between 73 and 75. The participants found the system useful ( There was significant variance in the results (95%). One participant found the agent and the instructional video to be detrimental to therapy. |
On average, the questions provided helped the participants recall memory ( No difference found in system components ( Average usefulness of questions: SD = 8.37. Average usefulness of system: SD = 16.17. No difference in components: Usefulness SD = 6.12, usability SD = 14.96. | |
Abbreviations: Avg.: average; OOV: out of vocabulary; RCT: randomized control trial, app: application; p: p-value; Laura: chatbot prototype who blends speech recognition, AI, and realistic 3D animation; SELMA app: A Digital Coach for Self-Management of Pain; Wysa app: AI-powered mental health app; CARMIE: a smartphone-based assistant developed with the aim to deliver information and knowledge-based advice to help chronic disease patients; CA: conversational agents; ECA: Embodied Conversational Agent; PDA: Personal digital assistance; eADVICE: electronic Advice and Diagnosis via the Internet following Computerised Evaluation; Dr. Evie: eVirtual agent for incontinence and enuresis; PHQ-9: Patient Health Questionnaire 9-item scale, measures the frequency and severity of depressive symptoms; DBT: Dialectical Behavior Therapy. OASIS: Development and Validation of an Overall Anxiety Severity and Impairment Scale; SD: standard deviation.
Characteristics of the conversational agents evaluated in the included studies.
| Author, Year | Type of Communication Technology; Type of Conversational Agent | AI Methods Used | Dialogue Management | Dialogue Initiative | Input | Output | Task-Oriented |
|---|---|---|---|---|---|---|---|
| Azzini et al., 2003 | Smartphone and web-based; spoken dialog system. | Speech recognition and spoken dialog system. | Finite-state | Mixed | Spoken | Spoken, written | Yes |
| Baptista et al., 2020 | Smartphone app; ECA. | Speech recognition, natural language processing. | Finite-state | System | Spoken, visual | Spoken, written, visual | Yes |
| Beaudry et al., 2019 | Text messaging platform; chatbot. | Machine learning, NLU, NLP, deep learning, speech recognition. | Finite-state | System | Written | Written | Yes |
| Bickmore et al., 2010 | Framework; ECA. | Speech recognition, synthetic voice. | Finite-state | System | Spoken, visual | Spoken, written, visual | Yes |
| Bickmore et al., 2010 | Home desktop software; animated agent and interaction dialogues. | Not reported. | Finite-state | System | Visual | Spoken, visual | Yes |
| Bott et al., 2019 | Platform; ECA. | Text-to-speech, NLU. | Frame-based | Mixed | Spoken, visual | Spoken, written, visual | Yes |
| Chaix et al., 2019 | Smartphone and web-based; chatbot. | Machine learning, NLP. | Finite-state | System | Written, visual | Written | Yes |
| Dworkin et al., 2018 | Smartphone app; Avatar-based embodied agent. | Not reported. | Finite-state | Mixed | Spoken, written, visual | Spoken, written, visual | Yes |
| Easton et al., 2019 | Web-based; avatar and chatbot. | NLP, speech recognition. | Frame-based | Mixed | Spoken, written | Spoken, written, visual | Yes |
| Greer et al., 2019 | Facebook messenger; chatbot. | Not reported. | Finite-state | System | Written | Written, visual | Yes |
| Hauser-Ulrich et al., 2019 | Smartphone app; chatbot. | Not reported. | Finite-state | System | Written | Written, visual | No |
| Inkster et al., 2018 | Smartphone app; chatbot. | Machine learning, unsupervised learning. | Finite-state | System | Written | Written, visual | Yes |
| Lobo et al., 2017 | Android app; chatbot. | Speech recognition, speech synthesis, spoken natural language, hidden | Frame-based | Mixed | Spoken, written | Spoken, written | Yes |
| Neerincx et al., 2019 | Platform independent app, robot and avatar. | Machine learning, deep learning, speech recognition, speech synthesis. | Finite-state | System | Visual | Spoken, written, visual | Yes |
| Rehman et al., 2020 | Android app; chatbot. | NLU, speech recognition, text to speech synthesis, neural network algorithm, machine learning, natural language processing, deep learning, spoken dialog. | Frame-based | User | Spoken, written | Spoken, written | Yes |
| Stephens et al., 2019 | SMS text messaging; chatbot. | Not reported. | Frame-based | Mixed | Written | Written | Yes |
| O’Hara et al., 2008 | Personal Digital Assistants (PDAs). | Not reported. | Finite-state | System | Written | Written | Yes |
| Philip et al., 2017 | Home desktop software; Virtual | Speech recognition, | Finite-state | System | Spoken | Spoken | Yes |
| Piau et al., 2019 | Semi-automated smartphone | Speech to text. | Finite-state | System | Written | Written | Yes |
| Puskar et al., 2011 | Home desktop software; Relational Agent. | NLU, facial recognition, | Frame-based | System | Written | Written | Yes |
| Richards and Caldwell, 2018 | Website; Avatar and Empathic ECA a. | Speech to text. | Finite-state | System | Visual | Written; spoken | No |
| Ryu et al., 2020 | Smartphone app; chatbot. | Speech recognition. | Frame-based | System | Visual | Written | No |
| Schroeder et al., 2018 | Smartphone app; chatbot. | Not reported. | Finite-state | System | Visual | Written | Yes |
| Sebastian & Richards, 2017 | Platform independent app; ECA. | Not reported. | Finite-state | System | Visual | Written | Yes |
| Shamekhi & Bickmore, 2018 | Home desktop software; an animated agent with spoken dialogue and sensing. | Spoken dialog system. | Frame-based | System | Respiration sensor | Spoken | Yes |
| Tielman et al., 2017 | Home desktop software; an animated agent with spoken dialogue. | Spoken dialog system. | Finite-state | System | Visual | Spoken; written | Yes |
Abbreviations: app: application; ECA: Embodied Conversational Agent; a Empathic ECA: empathic agent that provides face-to-face conversation in an empathic and caring way, to act as a virtual doctor for the family to interact with.
Figure 1Flow Diagram.
Characterisation of conversational agents (Laranjo et al. 2018 [27]).
|
| Finite-state | The user is taken through a dialogue consisting of a sequence of pre-determined steps or states. |
| Frame-based | The user is asked questions that enable the system to fill slots in a template in order to perform a task. | |
| The dialogue flow is not pre-determined, but it depends on the content of the user’s input and the information that the system has to elicit. | ||
| Agent-based | These systems enable complex communication between the system, the user, and the application. There are many variants of agent-based systems, depending on what aspects of intelligent behavior are designed into the system. In agent-based systems, communication is viewed as the interaction between two agents, each of which is capable of reasoning its own actions and beliefs, and sometimes the actions and beliefs of the other agent. The dialogue model takes the preceding context into account, with the result that the dialogue evolves dynamically as a sequence of related steps that build on each other. | |
|
| User | The user leads the conversation. |
| System | The system leads the conversation. | |
| Mixed | Both the user and the system can lead the conversation. | |
|
| Spoken | The user uses spoken language to interact with the system. |
| Written | The user uses written language to interact with the system. | |
|
| Spoken, Written, visual (e.g., non-verbal communication like facial expressions or body movements). | |
|
| Yes | The system is designed for a particular task and is set up to have short conversations, in order to get the necessary information to achieve the goal (e.g., booking a consultation). |
| No | The system is not directed to the short-term achievement of a specific end-goal or task (e.g., purely conversational chatbots). | |