| Literature DB >> 36062266 |
Mels de Kloet1, Shengyun Yang1.
Abstract
Voice intelligence is a revolutionary "zero-touch" type of human-machine interaction based on spoken language. There has been a recent increase in the number and variations of voice assistants and applications that help users to acquire information. The increased popularity of voice intelligence, however, has not been reflected in the customer value chain. Current research on the socio-technological aspects of human-technology interaction has emphasized the importance of anthropomorphism and user identification in the adoption of the technology. Prior research has also pointed out that user perception toward the technology is key to its adoption. Therefore, this research examines how anthropomorphism and multimodal biometric authentication influence the adoption of voice intelligence through user perception in the customer value chain. In this study we conducted a between-subjects online experiment. We designed a 2 × 2 factorial experiment by manipulating anthropomorphism and multimodal biometric authentication into four conditions, namely with and without a combination of these two factors. Subjects were recruited from Amazon MTurk platform and randomly assigned to one of the four conditions. The results drawn from the empirical study showed a significant direct positive effect of anthropomorphism and multimodal biometric authentication on user adoption of voice intelligence in the customer value chain. Moreover, the effect of anthropomorphism is partially mediated by users' perceived ease of use, perceived usefulness, and perceived security risk. This research contributes to the existing literature on human-computer interaction and voice intelligence by empirically testing the simultaneous impact of anthropomorphism and biometric authentication on users' experience of the technology. The study also provides practitioners who wish to adopt voice intelligence in the commercial environment with insights into the user interface design.Entities:
Keywords: anthropomorphism; artificial intelligence; customer value chain; multimodal biometric authentication; user perception; voice intelligence
Year: 2022 PMID: 36062266 PMCID: PMC9428311 DOI: 10.3389/frai.2022.831046
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Figure 1Research model.
Components of voice intelligence (Pieraccini, 2012; Hirschberg and Manning, 2015; de Barcelos Silva et al., 2020; Amazon, 2021).
|
|
|
|---|---|
| Automated Speech Recognition (ASR) | Coverts the given input into understandable commands by matching words with patterns in sound peaks. |
| Natural Language Processing (NLP) | Semantically structures the linguistic utterance using computational techniques. |
| Dialog Management (DM) | Decides which action should be performed according to previous interactions' dialog strategy and experiences. |
| Response Generator (RG) | Produces the output text and synthesizes it to voice. |
Technical modules of the biometric authentication process (Liu et al., 2017; Mahfouz et al., 2017).
|
|
|
|
|---|---|---|
| 1 | Sensor | Measures or records the unprocessed biometric data of the user. |
| 2 | Biometric extractor | Cleans the raw data by detecting and removing oddities to improve the data quality. |
| 3 | Biometric matcher | Compares the input features with the data template to generate a matching score. |
2 × 2 Factorial design.
|
|
| ||
|---|---|---|---|
|
|
| ||
| Anthropomorphism (ANT) | With | T4: ANT | MBA | T3: ANT | NMBA |
| Without | T2: NANT | MBA | T1: NANT | NMBA | |
Overview of the subjects.
|
|
|
|
|---|---|---|
| Gender | ||
| Male | 166 | 69.2 |
| Female | 73 | 30.4 |
| Non-binary | 1 | 0.4 |
| Age–groups | ||
| 18–24 | 12 | 5 |
| 25–34 | 139 | 57.9 |
| 35–44 | 60 | 25 |
| 45–54 | 20 | 8.3 |
| 55–64 | 9 | 3.8 |
| Educational background | ||
| High-School graduate | 13 | 5.4 |
| Attended college | 14 | 5.8 |
| Bachelor's degree | 145 | 60.4 |
| Master's degree | 68 | 28.3 |
| Continent | ||
| Africa | 2 | 0.8 |
| Asia | 29 | 12.1 |
| Europe | 5 | 2.1 |
| North America | 198 | 82.5 |
| South America | 6 | 2.5 |
| Total | 240 | 100 |
Assessment of measurement model.
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|
| PEU | 5 | 0.74 | 0.93 | 0.67 |
| ||||
| PU | 5 | 0.72 | 0.90 | 0.63 |
|
| |||
| PPC | 5 | 0.73 | 0.91 | 0.67 |
|
|
| ||
| PSR | 5 | 0.62 | 0.83 | 0.53 |
|
|
|
| |
| AVI | 5 | 0.74 | 0.92 | 0.63 |
|
|
|
|
|
PEU, perceived ease of use; PU, perceived usefulness; PPC, perceived privacy concerns; PSR, perceived security risks; AVI, adoption of voice intelligence. Diagonal elements (bold) are the square root of the AVE for each construct; Off-diagonal factors correspond to construct intercorrelations. The italics values are used to determine the discriminant validity. All values exceed the corresponding intercorrelation.
Descriptive statistics.
|
|
|
|
| |
|---|---|---|---|---|
| PEU | 5.66 | 0.94 | 2.20 | 7.00 |
| PU | 5.53 | 0.93 | 2.20 | 7.00 |
| PSR | 5.33 | 1.01 | 1.60 | 7.00 |
| PPC | 5.19 | 1.17 | 1.00 | 7.00 |
| AVI | 5.48 | 0.97 | 2.00 | 7.00 |
Descriptive statistics per treatment.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| T1 ( | 5.33 (1.16) | 5.30 (1.15) | 5.18 (1.22) | 5.11 (1.18) | 5.23 (1.18) |
| T2 ( | 5.80 (0.89) | 5.65 (0.76) | 5.52 (0.79) | 5.22 (1.04) | 5.62 (0.97) |
| T3 ( | 5.59 (0.84) | 5.39 (0.88) | 5.12 (0.78) | 5.10 (1.01) | 5.40 (0.88) |
| T4 ( | 5.92 (0.73) | 5.79 (0.80) | 5.50 (1.11) | 5.36 (1.40) | 5.66 (0.76) |
T1, without anthropomorphic and multimodal biometric treatment; T2, without anthropomorphic but with multimodal biometric treatment; T3, with anthropomorphic but without multimodal biometric treatment; T4, with anthropomorphic and multimodal biometric treatment.
Factorial ANOVA analysis of direct effects.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| MBA | 1 | 13.824 | 14.935 | 0.06 | [0.02; 0.11] |
| ANT | 1 | 3.851 | 4.160 | 0.02 | [0.00; 0.05] |
| ANT | MBA | 1 | 2.817 | 3.043 | 0.01 | [0.00; 0.05] |
| Residuals | 236 | 0.926 |
p < 0.05,
,
p < 0.001.
Path analyses.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
| ANT → PEU | 0.6733333 | 0.2865799 | 2.3495484 | 1.2331072 | 0.1045226 |
| ANT → PU | 0.4000000 | 0.1545632 | 2.5879387 | 0.6966261 | 0.0896086 |
| ANT → PSR | −0.7733333 | 0.3559318 | −2.1727007 | −1.4400280 | −0.0584524 |
| ANT → PPC | −0.5199999 | 0.4442834 | −1.170424 | −1.3440642 | 0.4131769 |
| MBA → PEU | −0.2533333 | 0.2968869 | −0.8532991 | −0.8383091 | 0.3273333 |
| MBA → PU | −0.2933333 | 0.2860157 | −1.0255845 | −0.8418642 | 0.2946706 |
| MBA → PSR | −0.0333333 | 0.3588537 | −0.0928884 | −0.6425856 | 0.7668297 |
| MBA → PPC | −0.2800000 | 0.4486493 | −0.6240955 | −1.1246750 | 0.6332600 |
| PEU → AVI | 0.6490008 | 0.0680327 | 9.5402193 | 0.5126917 | 0.7830255 |
| PU → AVI | 0.6645794 | 0.0618023 | 10.8122354 | 0.5429231 | 0.7872081 |
| PSR → AVI | −0.5641075 | 0.0571241 | −9.8810212 | −0.4544264 | −0.6781363 |
| PPC → AVI | −0.3268894 | 0.0626997 | −5.2169322 | −0.2184188 | −0.465145 |
|
| |||||
| ANT → PEU → AVI | 0.4198978 | 0.1867468 | 2.2484875 | 0.8132560 | 0.0733127 |
| ANT → PU → AVI | 0.2459281 | 0.0976730 | 2.5178720 | 0.4426606 | 0.0566385 |
| ANT → PSR → AVI | 0.4339686 | 0.2187507 | 1.9838498 | 0.8881447 | 0.0338555 |
| ANT → PPC → AVI | −0.1672892 | 0.1580518 | −1.058445 | −0.5145335 | 0.1149420 |
| MBA → PEU → AVI | −0.1708457 | 0.2037402 | −0.8385465 | −0.5888469 | 0.2095889 |
| MBA → PU → AVI | −0.2095392 | 0.2091822 | −1.0017067 | −0.6322932 | 0.1991155 |
| MBA → PSR → AVI | 0.0189017 | 0.2052428 | 0.0920943 | −0.3873673 | 0.4192312 |
| MBA → PPC → AVI | 0.0929792 | 0.1570425 | 0.5920644 | −0.4200953 | 0.1982651 |
p < 0.05,
p < 0.01,
p < 0.001.
Validation of the hypotheses.
|
|
|
|---|---|
| H1: Anthropomorphic characteristics are likely to increase users' perceived ease of use, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Supported |
| H2: Anthropomorphic characteristics are likely to increase users' perceived usefulness, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Supported |
| H3: Anthropomorphic characteristics are likely to decrease users' perceived security risks, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Supported |
| H4: Anthropomorphic characteristics are likely to decrease users' perceived privacy concerns, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Partially supported |
| H5: Multimodal biometric authentication is likely to increase users' perceived ease of use, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Partially supported |
| H6: Multimodal biometric authentication is likely to increase users' perceived usefulness, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Partially supported |
| H7: Multimodal biometric authentication is likely to decrease users' perceived security risks, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Partially Supported |
| H8: Multimodal biometric authentication is likely to decrease users' perceived privacy concerns, which in turn positively influences the adoption of voice intelligence in the customer value chain. | Partially supported |
Factorial ANOVA analysis of effects of BA and ANT on users' overall perception.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| ANT | 1 | 9.204 | 11.7173 | 0.05 | [0.01; 0.10] |
| MBA | 1 | 22.204 | 28.2667 | 0.11 | [0.05; 0.17] |
| ANT | MBA | 1 | 5.104 | 6.49788 | 0.03 | [0.00; 0.07] |
| Residuals | 236 | 0.786 |
p < 0.05,
,
p < 0.001.
Regression for demographic and control variables.
|
|
|
|
|---|---|---|
| Users' perceptions | ||
| PEU | 0.56702 | 0.06132 |
| PU | 0.46199 | 0.06630 |
| PSR | −0.30658 | 0.05476 |
| PPC | −0.33341 | 0.03965 |
| Gender | ||
| Male | 0.28640 | 0.59893 |
| Female | 0.10956 | 0.59927 |
| Non-binary | - | |
| Age–groups | ||
| 18–24 | 0.23545 | 0.61486 |
| 25–34 | 0.17946 | 0.18464 |
| 35–44 | 0.41781 | 0.19317 |
| 45–54 | 0.26196 | 0.22145 |
| 55–64 | 0.46221 | 0.28414 |
| Educational background | ||
| High-School graduate | 0.48594 | 0.24902 |
| Attended college | 0.48660 | 0.46167 |
| Bachelor's degree | 0.50146 | 0.18408 |
| Master's degree | 0.42443 | 0.19303 |
| Continent | ||
| Africa | 0.56992 | 0.42655 |
| Asia | 0.02466 | 0.11804 |
| Europe | 0.12591 | 0.29959 |
| North America | 0.1717 | 0.42655 |
| South America | 0.18115 | 0.26708 |
| R sq adjust. (0.6372) | 0.6491 ( | |
| R sq adjust. change | 0.0181 | |
| F statistic | 24.27 | |
| No. of observation | 240 |
p < 0.05,
p < 0.01,
p < 0.001.
The italic value shows the effect in the adjusted R squared after changing the included variables during the analysis.
Figure 2Interaction effects. (A) Examining the effect of anthropomorphism and multimodal biometric authentication on the customer's willingness to adopt voice technology in their value chain. (B) Examining the effect of anthropomorphism and multimodal biometric authentication on the overall customer perception of voice technology in general.