| Literature DB >> 32985999 |
Ying-Chieh Liu1,2, Chien-Hung Chen3, Yu-Sheng Lin2, Hsin-Yun Chen4, Denisa Irianti1, Ting-Ni Jen5, Jou-Yin Yeh6, Sherry Yueh-Hsia Chiu7,8.
Abstract
BACKGROUND: Advances in voice technology have raised new possibilities for apps related to daily health maintenance. However, the usability of such technologies for older users remains unclear and requires further investigation.Entities:
Keywords: automatic speech recognition; elderly; food report; mHealth; randomized trial; usability evaluation; voice-added design
Mesh:
Year: 2020 PMID: 32985999 PMCID: PMC7551114 DOI: 10.2196/20317
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Figure 1Voice-only reporting operation of a dish with a single ingredient, using steamed rice as an example.
Figure 2Voice-only reporting operation of a dish with two or three ingredients, using stir-fried broccoli with carrot as an example.
Figure 3Voice-button reporting operation of a dish with a single ingredient, using steamed rice as an example.
Figure 4Voice-button reporting operation for a dish with two or more ingredients, using stir-fried broccoli with carrot as an example.
Figure 5App evaluation flow using a randomized design. SUS: system usability scale.
Participant characteristics in the voice-only reporting and voice-button reporting groups.
| Variables | Total (N=57), n (%) or mean (SD) | Voice-only reporting group (n=30), n (%) | Voice-button reporting group (n=27), n (%) | ||||||
|
|
|
|
| .39 | |||||
|
| Male | 12 (21%) | 5 (17%) | 7 (26%) |
| ||||
|
| Female | 45 (79%) | 25 (83%) | 20 (74%) |
| ||||
|
|
|
|
| .15 | |||||
|
| ≤64 | 7 (12%) | 6 (20%) | 1 (4%) |
| ||||
|
| 65-74 | 23 (40%) | 10 (33%) | 13 (48%) |
| ||||
|
| ≥75 | 27 (48%) | 14 (47%) | 13 (48%) |
| ||||
| BMI (kg/m2)a | 22.55 (2.25) | 22.63 (2.37) | 22.45 (2.15) | .77 | |||||
|
|
|
|
| >.99 | |||||
|
| Junior high school | 12 (21%) | 6 (20%) | 6 (22%) |
| ||||
|
| Senior high/vocational school | 11 (19%) | 6 (20%) | 5 (19%) |
| ||||
|
| Bachelor’s degree | 28 (49%) | 14 (47%) | 14 (52%) |
| ||||
|
| Master’s degree | 5 (9%) | 3 (10%) | 2 (7%) |
| ||||
|
| Others | 1 (2%) | 1 (3%) | 0 (0%) |
| ||||
|
|
|
|
| .40 | |||||
|
| Yes | 18 (32%) | 8 (27%) | 10 (37%) |
| ||||
|
| No | 39 (68%) | 22 (73%) | 17 (63%) |
| ||||
|
|
|
|
| .58 | |||||
|
| Yes | 17 (30%) | 8 (27%) | 9 (33%) |
| ||||
|
| No | 40 (70%) | 22 (73%) | 18 (67%) |
| ||||
|
|
|
|
| .60 | |||||
|
| Yes | 54 (95%) | 29 (97%) | 25 (93%) |
| ||||
|
| No | 3 (5%) | 1 (3%) | 2 (7%) |
| ||||
|
|
|
|
| >.99 | |||||
|
| Yes | 4 (7%) | 2 (7%) | 2 (7%) |
| ||||
|
| No | 53 (93%) | 28 (93%) | 25 (93%) |
| ||||
|
|
|
|
| .38 | |||||
|
| Yes | 39 (68%) | 19 (63%) | 20 (74%) |
| ||||
|
| No | 18 (32%) | 11 (37%) | 7 (26%) |
| ||||
aAge and BMI data were analyzed with analysis of variance.
Overall accuracy comparison of error types in the voice-only reporting and voice-button reporting groups.
| Error type (correct/incorrect)a | Total (N=57), n (%) | Voice-only reporting group (n=30), n (%) | Voice-button reporting group (n=27), n (%) | |||
|
|
|
|
|
| ||
|
| Correct | 796 (99.7%) | 418 (99.5%) | 378 (100.0%) | .50 | |
|
| Incorrect | 2 (0.3%) | 2 (0.5%) | 0 (0.0%) |
| |
|
|
|
|
|
| ||
|
| Correct | 792 (99.2%) | 416 (99.0%) | 376 (99.5%) | .69 | |
|
| Incorrect | 6 (0.8%) | 4 (1.0%) | 2 (0.5%) |
| |
|
|
|
|
|
| ||
|
| Correct | 743 (93.1%) | 392 (93.3%) | 351 (92.9%) | .79 | |
|
| Incorrect | 55 (6.9%) | 28 (6.7%) | 27 (7.1%) |
| |
|
|
|
|
|
| ||
|
| Correct | 766 (96.0%) | 416 (99.0%) | 350 (92.6%) | <.001 | |
|
| Incorrect | 32 (4.0%) | 4 (1.0%) | 28 (7.4%) |
| |
|
|
|
|
|
| ||
|
| Correct | 794 (99.5%) | 416 (99.0%) | 378 (100.0%) | .13 | |
|
| Incorrect | 4 (0.5%) | 4 (1.0%) | 0 (0.0%) |
| |
|
|
|
|
|
| ||
|
| Correct | 771 (96.6%) | 404 (96.2%) | 367 (97.1%) | .48 | |
|
| Incorrect | 27 (3.4%) | 16 (3.8%) | 11 (2.9%) |
| |
|
|
|
|
|
| ||
|
| Correct | 759 (95.1%) | 420 (100.0%) | 339 (89.7%) | <.001 | |
|
| Incorrect | 39 (4.9%) | 0 (0.0%) | 39 (10.3%) |
| |
|
|
|
|
|
| ||
|
| Correct | 775 (97.1%) | 409 (97.4%) | 366 (96.8%) | .64 | |
|
| Incorrect | 23 (2.9%) | 11 (2.6%) | 12 (3.2%) |
| |
aThree items in beverage were not counted as no error types were found. Fourteen out of the 17 food items were included.
b#1 Missing first food name/syllable(s): After verbal reporting, the presented answer list did not include the first food name or the first syllable(s) of the food names.
c#2 Missing last food name/syllable(s): After verbal reporting, the presented answer list did not include the last food name or the last syllable(s) of the food names after voice reporting.
d#3 No desirable choices: After verbal reporting, the presented answer list did not present the desired food name or cooking method.
e#4 Missing cooking method(s): After verbal reporting, the presented answer list did not include the desired cooking method(s).
f#5 Repeated pronunciations: The presented answer list showed repeated pronunciations of food names and/or food attributes after voice reporting.
g#6 Incorrect selections in the list: Participant had trouble accurately tapping the desired choice (click interaction), leading to incorrect selection in the answer list.
h#7 Did not select the ‘mix’ button: Trouble before dish completion (click interaction). The user did not tap the “mix” button to complete dishes with two or three ingredients.
i#8 Incorrect operations: Incorrect operation procedure.
Accuracy comparison of each food item in the voice-only reporting and voice-button reporting groups.
| Food item and error type | Total (N=57), n (%) | Voice-only reporting group (n=30), n (%) | Voice-button reporting group (n=27), n (%) | ||
|
|
|
|
| ||
|
|
|
|
|
| |
|
|
| #1a | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2b | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3c | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #4d | 3 (5%) | 1 (3%) | 2 (7%) |
|
|
| #5e | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6f | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #7g | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8h | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 3 (5%) | 2 (7%) | 1 (4%) |
|
|
| #4 | 6 (11%) | 0 (0%) | 6 (22%) |
|
|
| #5 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #6 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
|
|
| ||
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 2 (4%) | 1 (3%) | 1 (4%) |
|
|
| #4 | 3 (5%) | 1 (3%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 2 (4%) | 1 (3%) | 1 (4%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 2 (4%) | 1 (3%) | 1 (4%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 3 (5%) | 2 (7%) | 1 (4%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #4 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 3 (5%) | 0 (0%) | 3 (11%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 10 (18%) | 6 (20%) | 4 (15%) |
|
|
| #4 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #7 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #8 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
|
|
| ||
|
|
|
|
|
| |
|
|
| #1 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #2 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #3 | 19 (33%) | 12 (40%) | 7 (26%) |
|
|
| #4 | 3 (5%) | 0 (0%) | 3 (11%) |
|
|
| #5 | 2 (4%) | 2 (7%) | 0 (0%) |
|
|
| #6 | 12 (21%) | 10 (33%) | 2 (7%) |
|
|
| #7 | 14 (25%) | 0 (0%) | 14 (52%) |
|
|
| #8 | 6 (11%) | 2 (7%) | 4 (15%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #4 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #7 | 3 (5%) | 0 (0%) | 3 (11%) |
|
|
| #8 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #3 | 2 (4%) | 1 (3%) | 1 (4%) |
|
|
| #4 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #7 | 7 (12%) | 0 (0%) | 7 (26%) |
|
|
| #8 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #7 | 5 (9%) | 0 (0%) | 5 (19%) |
|
|
| #8 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #3 | 7 (12%) | 2 (7%) | 5 (19%) |
|
|
| #4 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #7 | 3 (5%) | 0 (0%) | 3 (11%) |
|
|
| #8 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
| ||
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #3 | 5 (9%) | 2 (7%) | 3 (11%) |
|
|
| #4 | 4 (7%) | 2 (7%) | 2 (7%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #6 | 2 (4%) | 2 (7%) | 0 (0%) |
|
|
| #7 | 5 (9%) | 0 (0%) | 5 (19%) |
|
|
| #8 | 5 (9%) | 4 (13%) | 1 (4%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 2 (4%) | 2 (7%) | 0 (0%) |
|
|
| #3 | 3 (5%) | 2 (7%) | 1 (4%) |
|
|
| #4 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #5 | 1 (2%) | 1 (3%) | 0 (0%) |
|
|
| #6 | 1 (2%) | 0 (0%) | 1 (4%) |
|
|
| #7 | 2 (4%) | 0 (0%) | 2 (7%) |
|
|
| #8 | 3 (5%) | 2 (7%) | 1 (4%) |
|
|
|
|
| ||
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
|
|
|
| |
|
|
| #1 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #2 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #3 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #4 | 0 (0%) | 0 (0%) | 0 (0%) |
|
|
| #5 | 0 (0%) | 0 (0%) | 0 (0%) |
a#1 Missing first food name/syllable(s): After verbal reporting, the presented answer list did not include the first food name or the first syllable(s) of the food names.
b#2 Missing last food name/syllable(s): After verbal reporting, the presented answer list did not include the last food name or the last syllable(s) of the food names after voice reporting.
c#3 No desirable choices: After verbal reporting, the presented answer list did not present the desired food name or cooking method.
d#4 Missing cooking method(s): After verbal reporting, the presented answer list did not include the desired cooking method(s).
e#5 Repeated pronunciations: The presented answer list showed repeated pronunciations of food names and/or food attributes after voice reporting.
f#6 Incorrect selections in the list: Participant had trouble accurately tapping the desired choice (click interaction), leading to incorrect selection in the answer list.
g#7 Did not select the ‘mix’ button: Trouble before dish completion (click interaction). The user did not tap the “mix” button to complete dishes with two or three ingredients.
h#8 Incorrect operations: Incorrect operation procedure.
Reporting time in the voice-only reporting and voice-button reporting groups.
| Food item | Reporting time (s) | ||||
| Total (N=57), mean (SD) | Voice-only reporting group (n=30), mean (SD) | Voice-button reporting group (n=27), mean (SD) | |||
|
|
|
|
|
| |
|
| Boiled rice porridge | 20.44 (16.20) | 10.50 (4.57) | 31.49 (17.36) | <.001 |
|
| Steamed rice | 14.37 (8.19) | 10.11 (4.98) | 19.11 (8.53) | <.001 |
|
| Stir-fried noodle | 14.20 (10.75) | 8.67 (3.45) | 20.35 (12.69) | <.001 |
|
|
|
|
|
| |
|
| Grilled pork sausage | 26.39 (38.58) | 12.20 (6.70) | 42.15 (51.63) | .006 |
|
| Deep-fried chicken egg | 16.54 (10.47) | 11.46 (8.82) | 22.17 (9.32) | <.001 |
|
| Fried chicken leg | 16.80 (13.53) | 8.99 (2.97) | 25.48 (15.36) | <.001 |
|
| Pan-fried mackerel | 20.24 (12.69) | 15.01 (11.23) | 26.06 (11.81) | <.001 |
|
|
|
|
|
| |
|
| Stewed wheat gluten with peanuts | 51.73 (32.09) | 42.38 (28.05) | 62.11 (33.58) | .02 |
|
| Stir-fried broccoli with carrot | 36.20 (36.30) | 12.68 (4.17) | 62.34 (38.34) | <.001 |
|
| Stir-fried tofu with green bean | 30.98 (27.86) | 10.80 (4.90) | 53.41 (25.56) | <.001 |
|
| Stir-fried chicken egg with tomato | 32.32 (28.73) | 12.32 (6.32) | 54.55 (27.54) | <.001 |
|
| Stir-fried bitter melon with salted duck egg | 33.90 (32.36) | 12.39 (5.11) | 57.80 (33.16) | <.001 |
|
|
|
|
|
| |
|
| Stir-fried cabbage with bacon and black fungus | 44.39 (35.08) | 21.23 (19.68) | 70.13 (30.20) | <.001 |
|
| Stir-fried dry bean curd with bell pepper and carrot | 41.81 (33.76) | 16.62 (8.42) | 69.80 (28.82) | <.001 |
|
|
|
|
|
| |
|
| Soymilk | 10.82 (7.20) | 9.91 (6.31) | 11.86 (8.11) | .31 |
|
| Green tea | 8.76 (2.66) | 8.86 (3.07) | 8.66 (2.16) | .78 |
|
| Milk tea | 9.64 (6.26) | 10.81 (8.35) | 8.35 (1.84) | .13 |
System usability scale and subjective perception in the voice-only reporting and voice-button reporting groups.
| Scorea,b,c | Voice-only reporting group (n=30), mean (SD) | Voice-button reporting group (n=27), mean (SD) | |
| Overall score | 83.80 (9.49) | 80.44 (10.25) | .20 |
| Usability score | 83.58 (9.57) | 81.57 (9.69) | .43 |
| Learnability score | 84.67 (14.56) | 75.93 (20.24) | .06 |
aQuestionnaires were presented in Chinese.
bThe mean score of the system usability scale with adjective ratings were as follows: 35.7 (“poor”), 50.9 (“ok”), 71.4 (“good”), and 85.5 (“excellent”).
cThe questionnaire’s Cronbach α for voice-only reporting (α=.77) and voice-button reporting (α=.78) exceeded .70, indicating good internal consistency and reliability.