| Literature DB >> 35969431 |
Corina R Ronneberg1, Nancy E Wittels1, Olu A Ajilore1, Jun Ma1, Thomas Kannampallil2, Vikas Kumar1, Nan Lv1, Joshua M Smyth3, Ben S Gerber4, Emily A Kringle1, Jillian A Johnson3, Philip Yu1, Lesley E Steinman5.
Abstract
BACKGROUND: Artificial intelligence has provided new opportunities for human interactions with technology for the practice of medicine. Among the recent artificial intelligence innovations, personal voice assistants have been broadly adopted. This highlights their potential for health care-related applications such as behavioral counseling to promote healthy lifestyle habits and emotional well-being. However, the use of voice-based applications for behavioral therapy has not been previously evaluated.Entities:
Keywords: artificial intelligence; behavioral therapy; mental health; problem-solving therapy; user evaluation; voice assistants
Year: 2022 PMID: 35969431 PMCID: PMC9419044 DOI: 10.2196/38092
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
Figure 1User interaction with Lumen for problem-solving treatment (PST) sessions highlighting the various components. AWS: Amazon Web Services; EMA: ecological momentary assessment.
Baseline characteristics by prior problem-solving treatment (PST) experience.
| Characteristic | All Lumen formative evaluation participants (N=26) | Participants with prior PST experience (n=17) | Participants without prior PST experience (n=9) | ||
| Age (years), mean (SD) | 43.9 (11.9) | 42.6 (13.2) | 46.3 (9.2) | .46 | |
| Female, n (%) | 20 (77) | 11 (65) | 9 (100) | .04 | |
|
| |||||
|
| Non-Hispanic White | 4 (15) | 3 (18) | 1 (11) | .34 |
|
| African American | 13 (50) | 9 (53) | 4 (44) | .34 |
|
| Asian or Pacific Islander | 1 (4) | 1 (6) | 0 (0) | .34 |
|
| Hispanic | 6 (23) | 2 (12) | 4 (44) | .34 |
|
| Other (eg, decline to state or multirace) | 2 (8) | 2 (12) | 0 (0) | .34 |
|
| |||||
|
| High school or general education or less | 2 (8) | 1 (6) | 1 (11) | .95 |
|
| College—1 year to 3 years | 8 (31) | 5 (29) | 3 (33) | .95 |
|
| College—≥4 years | 10 (38) | 7 (41) | 3 (33) | .95 |
|
| Post college | 6 (23) | 4 (23) | 2 (22) | .95 |
|
| |||||
|
| <35,000 | 7 (27) | 4 (23) | 3 (33) | .32 |
|
| 35,000 to <55,000 | 7 (27) | 3 (18) | 4 (44) | .32 |
|
| 55,000 to <75,000 | 5 (19) | 4 (23) | 1 (11) | .32 |
|
| ≥75,000 | 7 (27) | 6 (35) | 1 (11) | .32 |
Paired t test results comparing NASA Task Load Index scores between sessions 1 and 2.
| Question | Session 1 (n=26), mean (SD) | Session 2, (n=23), mean (SD) | ||
| How mentally demanding was the task? ( | 42.7 (25.0) | 53.9 (26.1) | −1.80 (22) | .09 |
| How hurried or rushed were you in the pace of the task? ( | 36.5 (23.2) | 52.0 (29.1) | −2.37 (22) | .03 |
| How hard did you have to work to accomplish your level of performance? ( | 36.0 (23.4) | 42.8 (18.9) | −1.44 (22) | .16 |
| How insecure, discouraged, irritated, stressed, and annoyed were you? ( | 31.9 (22.0) | 38.5 (24.6) | −0.95 (22) | .35 |
| How successful were you in accomplishing what you were asked to do? ( | 34.6 (23.1) | 37.2 (23.3) | −0.37 (22) | .71 |
aItalicized text shows the various categories of the NASA Task Load Index scales.
Paired t test results comparing task, goal, and bond subscales of the Working Alliance Inventory–Technology Version between sessions 1 and 2.
| Scale | Session 1, mean (SD) | Session 2, mean (SD) | ||
| Task subscale | 5.2 (0.9) | 5.3 (0.9) | 0.11 (22) | .92 |
| Bond subscale | 4.9 (1.0) | 4.7 (1.0) | 1.49 (22) | .15 |
| Goal subscale | 5.0 (0.9) | 5.1 (0.9) | −0.32 (22) | .75 |
| Overall scale | 5.0 (0.9) | 5.0 (0.9) | 0.56 (22) | .58 |
Coding categories, their description, and examples from the interviews.
| Coding category (spreada, %) | Description | Example from data |
| Interactive task load (78%) | Participant description of the demands of interacting with Lumen. Includes: Temporal load (pace of interactions, whether there was ample time to provide a response) Cognitive load (density of content and length of sessions) |
“I felt kind of rushed when it was like time to, like, think through and write things” (3502) [Temporal load] “Sometimes it’s telling you a lot of things. So, for a user, it’s hard...You’re not looking at somebody. So, you’re really, really having to concentrate and pay attention, so if by any chance you miss something, then you kind of get lost” (1213) [Cognitive load] |
| Natural language understanding (46%) | Participant description of challenges that Lumen faced with understanding participants’ verbal responses. Includes: Spoken comprehension (breakdowns due to Lumen’s comprehension) Accent or enunciation issues (eg, understanding names) |
“I think it was difficult to provide the prompts that were requested, and I suspect that depending on the person’s accent or if they’re from—if maybe their English isn’t exactly clear, there may be some language issues” (5457) [Spoken comprehension and accent or enunciation issues] |
| Comparison with human coach (100%) | Comparison of Lumen to a human coach. Includes: Naturalness of voice or tone (presence or absence of emotion) Interactive engagement in conversation (whether Lumen was conversational) Lumen’s tone or inflection (identifying when Lumen was asking a question vs making a statement) Lumen vs human PSTb content (comparing depth of help Lumen provided relative to human in delivery of PST) Perceived Lumen benefits or drawbacks (pros and cons of receiving PST from Lumen relative to human, eg, accessibility, availability, and comfort with disclosure) |
“...just robotic. Like, I’m talking to like a machine robot. That’s my initial thought. But at the same time, not in the way that it’s like dumb, but in that it’s like very scientific and not very like human.” (6132) (Naturalness of voice or tone) “I think initially for me, what may be missing that I picked up on right away is the human interaction component. [...] a human as opposed to talking to like a device or a computer [...] So, I don’t know how differently it'll be the more I become engaged with it.” (3498) [Interactive engagement in conversation] “When I spoke with [the human coach], I found myself venting, if I may, and going in every which direction, whereas Lumen forces me to stay very rigid, and sometimes when going through problem solving, the emotional release of going in every which direction, direction, rather than going straight and narrow feels a lot more comfortable.” (3831) [Lumen vs human PST] “it allows accessibility to people who can’t travel or maybe they feel anxious around talking to another person. So, it eliminates like class, it eliminates race, it eliminates sex. It eliminates sort of those prejudice that could happen in like a person-to-person to person setting.” (6132) [Perceived Lumen benefits] |
| PST features in Lumen (78%) | Description of the PST features as delivered by Lumen. Includes: Program structure or format (feedback around the stepwise PST process) Virtual PST coaching (describing Lumen’s role in the PST process) |
“You know, I think if I’m if I am if my goal is truly trying and I have a problem, I just feel overwhelmed. I don’t know how to attack it. Well Lumen supplies that. It breaks it down. It pulls all of the jumbled information out of my head, leaves the emotion behind and helps me lay out a plan for essentially attacking the problem without the emotional stress of it.” (3831) [Program structure or format and virtual PST coaching] |
| User recommendations (62%) | Participants’ recommendations for: Lumen improvements (ideas for functions or features in the user interface) Interacting with Lumen (tips for others to have an effective session with Lumen) |
“I would tell them that like, so like you’re talking to a computerized app, so make sure you’re speaking clearly and slowly and like follow directions in order to get what you’re what you need from it.” (6132) [Interacting with Lumen] “I would say as a part of the app, have basically have the binder already inside the app and then maybe have a link to a principal PDF for those who want to do that.” (6023) [Lumen improvements] “I think it would be kind of cool, especially with it being linked with Alexa is if it had the ability to pick up keywords. So, like if I, you know, saying like I need to work on my diet or trainer or whatever, that somehow it was able to tap into some of those keywords. And while it’s talking back to me saying, you know. You know, we’ve looked into like some trainings in your area. We are going to send you emails of, you know, something like that that would be like really great or hear from information regarding blah, blah, blah, blah, blah.” (3498) [Lumen improvements] “She could be better if she if I could see it, even though is a mechanical thing or robot, I want to see Lumen, so I know how Lumen it looks...I’d rather see the person I’m talking to, even though [it] is a machine or whatever it is I would rather see, you know.” (7323) [Lumen improvements] |
| Technical issues (36%) | Technical issues that were experienced by participants during the sessions. Includes: Breakdowns in conversation |
“Well, I was a little confused when it just stopped. It was still on the app. [...] And then it just completely shut the app.” (3470) [Breakdowns in conversation] |
aSpread refers to the percentage of transcripts (total=50) that the coding category was present.
bPST: problem-solving treatment.