| Literature DB >> 35603227 |
Lin Li1, Kyung Young Lee2, Emmanuel Emokpae3, Sung-Byung Yang1.
Abstract
Thanks to artificial intelligence, chatbots have been applied to many consumer-facing applications, especially to online travel agencies (OTAs). This study aims to identify five quality dimensions of chatbot services and investigate their effect on user confirmation, which in turn leads to use continuance. In addition, the moderating role of technology anxiety in the relationships between chatbot quality dimensions and post-use confirmation is examined. Survey data were gathered from 295 users of Chinese OTAs. Partial Least Square (PLS) was used to analyze measurement and structural models. Understandability, reliability, assurance, and interactivity are positively associated with post-use confirmation and technology anxiety moderates the relationships between four chatbot quality dimensions and confirmation. Confirmation is positively associated with satisfaction, which in turn influences use continuance intention. This study examines how chatbot services in OTAs are considered by users (human-like agents vs. technology-enabled services) by investigating the moderating role of technology anxiety. © Institute of Applied Informatics at University of Leipzig 2020.Entities:
Keywords: Artificial intelligence; Chatbot service quality; Extended post-acceptance model of IS continuance; Human-machine interaction; Online travel agency; Technology anxiety
Year: 2021 PMID: 35603227 PMCID: PMC7817351 DOI: 10.1007/s12525-020-00454-z
Source DB: PubMed Journal: Electron Mark ISSN: 1019-6781
A summary of key literature on chatbot quality dimensions
| Chatbot quality dimension | Research setting | Main findings and implications | Reference |
|---|---|---|---|
| Understandability | An experiment was used to determine whether a chatbot could improve consumer experience or not. | Understandability does little help in improving customer support systems and simple tasks should be handled by chatbots. | Nguyen ( |
| A specific test was designed to evaluate the machines’ ability for the quality of chatbots. | Chatbots conduct intelligent behavior to understand human conversations. | Park et al. ( | |
| ELISA chatbot was evaluated in terms of information quality and service quality. | Chatbots with understandability could provide better solutions and performance. | Sensuse et al. ( | |
| Exploratory wizard-of-oz (WoZ) studies were conducted to simulate interactions with a hypothetical chatbot. | Chatbot agents that can understand and use humans’ humor are ranked more likable, cooperative, and capable than those that cannot. | Thies et al. ( | |
| Reliability | A simple Arabic chatbot was designed to study the chatbot applications and their challenges in chatbot development. | The reliability of chatbots could increase the acceptance rate of chatbots in the Arabic world’s online communities. | AlHagbani and Khan ( |
| A new methodology, Quark was used to develop human-chatbot interactions in a typical management process. | The reliability of chatbots can be ensured if a meaningful response is provided in a conversation. | Kalia et al. ( | |
| ELISA chatbot was evaluated in terms of information quality and service quality. | Chatbots that are reliable could enhance the effectiveness of job performance and motivate further development. | Sensuse et al. ( | |
| Responsiveness | Text-based conversational chatbots were tested in human instant messaging dialogues for the development of a computational model. | Responsiveness could significantly improve chatbot design quality in terms of creating interaction profiles. | Danilava et al. ( |
| A chatbot-focused quality model was developed by extracting extant literature and conducting expert interviews. | Responsiveness is one of the chatbot quality attributes and can be used for ensuring the quality of chatbots. | Meerschman and Verkeyn ( | |
| An experiment was conducted to determine if a chatbot could improve the consumer experience. | Responsiveness could significantly improve customer support systems. | Nguyen ( | |
| Assurance | A survey was used to explore the factors that influence user satisfaction, which leads to the use intention of chatbots in the context of financial service. | Assurance is a salient factor in service quality that influences user satisfaction and use intention of chatbots in financial services. | Lee and Park ( |
| A chatbot, Jennifer, was built to examine if the public information generated from reputable sources are better organized and shared during the COVID-19 outbreak. | Assurance of quality could be secured when the information is provided from reputable sources and information quality should be maintained in a rigorous process in an empathetic manner. | Li et al. ( | |
| Classifications were conducted to determine the key dimensions for the success of chatbots. | Quality assurance is a major dimension for unsophisticated script-based conversational chatbots. | Pereira and Díaz ( | |
| Interactivity | A 2x2x2 experiment was conducted to explore the interactivity with chatbots in choosing a digital camera on an e-Commerce website. | Interactivity is an important factor in increasing the humanness of chatbot-based systems. | Go and Sundar ( |
| An experimental survey was designed to explore the impact of interactivity with conversational agents on stress-related disclosure. | The level of interactions with conversational agents could affect the level of disclosure of sensitive topics. | Sannon et al. ( | |
| A five-condition, between-participants lab experiment on an online movie search website was conducted to find the effects of message interactivity on perceived contingency and user engagement. | The level of message interactivity including interaction history and chatbot function greatly influences individuals’ evaluations of the movie website. | Sundar et al. ( |
Fig. 1Research model
Operational definitions
| Construct | Operational definition | References |
|---|---|---|
| Understandability | The extent to which users perceive that a chatbot service understands human’s dialogues, the context of a conversation, and the nuance of human language | Park et al. ( |
| Reliability | The extent to which users perceive that a chatbot service has the ability to perform the promised service dependably and accurately | Parasuraman et al. ( |
| Responsiveness | The extent to which users perceive that a chatbot service shows a willingness to help users and provides prompt services to users | Parasuraman et al. ( |
| Assurance | The extent to which users perceive that a chatbot service has knowledge and ability to inspire trust and confidence to users | Parasuraman et al. ( |
| Interactivity | The extent to which users perceive that their communication with a chatbot service resembles the dialogues they have with human agents (with multiple times of interactions), so that feel in control of their personal needs when using it | Cho et al. ( |
| Technology anxiety | The extent to which users feel intimidation, unfamiliarity, and difficulty of using a chatbot service | Meuter et al. ( |
| Confirmation | The extent to which users’ initial expectation about the performance of chatbot-based OTAs has been met | Bhattacherjee ( |
| Satisfaction | The extent to which users perceive that their positive emotional state comes from an appraisal of the jobs done by chatbot-based OTAs | Bhattacherjee ( |
| Use continuance | The extent to which users perceive that they intend to continue using the chatbot-based OTAs | Bhattacherjee ( |
Fig. 2A user’s typical exchange of information in a conversation with a chatbot in OTAs
Demographics of respondents
| Demographics | Frequency | Percent |
|---|---|---|
| Gender | ||
| Female | 125 | 42.4% |
| Male | 170 | 57.6% |
| Age | ||
| < 16 | 15 | 5.1% |
| 17–21 | 156 | 52.9% |
| 22–30 | 100 | 33.9% |
| 31–45 | 19 | 6.4% |
| 46–64 | 4 | 1.4% |
| > 65 | 1 | 0.3% |
| Education | ||
| High school | 8 | 2.7% |
| Studying at the undergraduate level | 23 | 7.8% |
| Bachelor’s degree | 217 | 73.6% |
| Studying at the graduate level | 38 | 12.9% |
| Master’s degree and above | 9 | 3.1% |
| Income (Renminbi: RMB, Chinese Yuan) | ||
| < 3000 RMB | 82 | 27.8% |
| 3000–5000 RMB | 112 | 37.9% |
| 5000–7000 RMB | 61 | 20.7% |
| 7000–9000 RMB | 24 | 8.1% |
| 9000–11,000 RMB | 7 | 2.4% |
| > 11,000 RMB | 9 | 3.1% |
| Occupation | ||
| Student | 44 | 14.9% |
| Working | 238 | 80.7% |
| Unemployed | 7 | 2.4% |
| Other | 6 | 2.0% |
| Usage frequency (indicating how often do users use chatbot services) | ||
| Less than once a month | 63 | 21.4% |
| Once to twice a month | 172 | 58.3% |
| Once to twice a week | 34 | 11.5% |
| Three to four times a week | 13 | 4.4% |
| More than five times a week | 2 | 0.7% |
| Other | 11 | 3.7% |
| Average time per use (indicating, on average, how long do users use chatbot services) | ||
| Less than 5 min | 21 | 7.1% |
| 5 to 10 min | 108 | 36.6% |
| 10 to 20 min | 95 | 32.2% |
| 20 to 30 min | 46 | 15.6% |
| 30 min to 1 h | 19 | 6.4% |
| More than 1 h | 3 | 1% |
| Other | 3 | 1% |
| Total | 295 | 100% |
Questionnaire items
| Construct | Items | References |
|---|---|---|
| Use continuance | 1. My intentions are to continue using this chatbot service over other alternative means of communication or searching tools on this OTA. | Bhattacherjee |
| 2. All things considered, I expect to continue using this chatbot service often in the future. | ||
| 3. I can see myself increasing the use of this chatbot service if possible. | ||
| Satisfaction | 1. I like to use the chatbot service from this online travel website. | Fang et al. ( |
| 2.I am pleased with the experience of using this chatbot service. | ||
| 3. I think that using the chatbot service on this travel website is a good idea. | ||
| 4. Overall, I am satisfied with the experience of using this chatbot service. | ||
| Confirmation | 1. My experience with using this chatbot service was better than what I expected. | Hong et al. ( |
| 2. The service level provided by this chatbot service was better than what I expected. | ||
| 3. Overall, most of my expectations regarding the usage of this chatbot service were confirmed. | ||
| Technology anxiety | 1. I have avoided chatbot services because they are unfamiliar to me. | Meuter et al. ( |
| 2. I hesitate to use chatbot services for fear of making mistakes I cannot correct. | ||
| 3. I have difficulty understanding most technological matters relating to chatbot services. | ||
| 4. I am not able to keep up with important technological advances, such as the development of chatbot services. | ||
| Understandability | 1. I feel that what I’m saying to this chatbot system is well understood by the system. | Oulasvirta et al. ( |
| 2. I feel that the words in my questions are well understood by this chatbot service. | ||
| 3. I feel that this chatbot system understands my intentions when I ask a question to it. | ||
| Reliability | 1. This chatbot service is dependable. | Parasuraman et al. ( |
| 2. When I have problems, this chatbot service is sympathetic and reassuring. | ||
| 3. I feel I could rely on this chatbot for its services regarding my travel needs. | ||
| Responsiveness | 1. This chatbot provides prompt services that meet my expectations. | Parasuraman et al. ( |
| 2. This chatbot service responds to my requests promptly. | ||
| 3. This chatbot provides services exactly when I need them without any delay. | ||
| Assurance | 1. I trust this chatbot service. | Parasuraman et al. ( |
| 2. I feel safe and assured to have a conversation with this chatbot service. | ||
| 3. The chatbot service of this travel website has enough knowledge to answer my travel questions. | ||
| Interactivity | 1. I can be in control of my personal needs through this chatbot service. | Cho et al. ( |
| 2. I perceive this chatbot service to be sensitive to my personal needs. | ||
| 3. This chatbot service provides an opportunity for me to give my responses. |
Reliability and convergent validity
| Construct | Factor loading | Cronbach’s | Composite reliability (CR) | Average variance extracted (AVE) |
|---|---|---|---|---|
| Understandability (UN) | 0.866 | 0.785 | 0.875 | 0.700 |
| 0.862 | ||||
| 0.791 | ||||
| Reliability (REL) | 0.821 | 0.780 | 0.872 | 0.695 |
| 0.843 | ||||
| 0.836 | ||||
| Responsiveness (RES) | 0.804 | 0.674 | 0.815 | 0.595 |
| 0.710 | ||||
| 0.797 | ||||
| Assurance (ASS) | 0.804 | 0.743 | 0.854 | 0.660 |
| 0.828 | ||||
| 0.806 | ||||
| Interactivity (INT) | 0.818 | 0.825 | 0.884 | 0.657 |
| 0.824 | ||||
| 0.810 | ||||
| Confirmation (CF) | 0.821 | 0.773 | 0.868 | 0.688 |
| 0.869 | ||||
| 0.796 | ||||
| Satisfaction (SAT) | 0.858 | 0.812 | 0.877 | 0.642 |
| 0.746 | ||||
| 0.837 | ||||
| 0.758 | ||||
| Use continuance (UC) | 0.815 | 0.825 | 0.884 | 0.657 |
| 0.786 | ||||
| 0.852 | ||||
| 0.786 | ||||
| Technology anxiety (TA) | 0.720 | 0.670 | 0.786 | 0.483 |
| 0.800 | ||||
| 0.710 | ||||
| 0.521 |
Construct correlations and discriminant validity
| Construct | (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) |
|---|---|---|---|---|---|---|---|---|---|
| UN (1) | |||||||||
| REL (2) | 0.754 | ||||||||
| RES (3) | 0.674 | 0.678 | |||||||
| ASS (4) | 0.736 | 0.712 | 0.666 | ||||||
| INT (5) | 0.690 | 0.752 | 0.669 | 0.724 | |||||
| CF (6) | 0.615 | 0.649 | 0.557 | 0.612 | 0.615 | ||||
| SAT (7) | 0.678 | 0.689 | 0.585 | 0.637 | 0.659 | 0.757 | |||
| UC (8) | 0.608 | 0.650 | 0.552 | 0.586 | 0.594 | 0.637 | 0.746 | ||
| TA (9) | −0.440 | −0.430 | −0.528 | −0.392 | −0.375 | −0.291 | −0.321 | −0.327 |
Results of common method bias test
| Construct | Indicator | Substantive factor loading ( | Common method factor loading ( | ||
|---|---|---|---|---|---|
| UN | UN1 | 0.849 | 0.7208 | 0.008 | 0.0001 |
| UN2 | 0.968 | 0.9370 | −0.115 | 0.0132 | |
| UN3 | 0.684 | 0.4679 | 0.115 | 0.0132 | |
| REL | REL1 | 0.796 | 0.6336 | 0.033 | 0.0011 |
| REL2 | 0.763 | 0.5822 | 0.089 | 0.0079 | |
| REL3 | 0.943 | 0.8892 | −0.125 | 0.0156 | |
| RES | RES1 | 0.837 | 0.7006 | −0.001 | 0.0000 |
| RES2 | 0.972 | 0.9448 | −0.374 | 0.1399 | |
| RES3 | 0.439 | 0.1927 | 0.354 | 0.1253 | |
| ASS | ASS1 | 0.632 | 0.3994 | 0.188 | 0.0353 |
| ASS2 | 0.818 | 0.6691 | 0.018 | 0.0003 | |
| ASS3 | 0.992 | 0.9841 | −0.210 | 0.0441 | |
| INT | INT1 | 0.840 | 0.7056 | −0.042 | 0.0018 |
| INT2 | 0.824 | 0.6790 | 0.004 | 0.0000 | |
| INT3 | 0.790 | 0.6241 | 0.037 | 0.0014 | |
| CF | CF1 | 0.910 | 0.8281 | −0.104 | 0.0108 |
| CF2 | 0.797 | 0.6352 | 0.081 | 0.0066 | |
| CF3 | 0.784 | 0.6147 | 0.019 | 0.0004 | |
| SAT | SAT1 | 0.875 | 0.7656 | −0.014 | 0.0002 |
| SAT2 | 0.641 | 0.4109 | 0.116 | 0.0135 | |
| SAT3 | 0.909 | 0.8263 | −0.078 | 0.0061 | |
| SAT4 | 0.765 | 0.5852 | −0.014 | 0.0002 | |
| UC | UC1 | 0.773 | 0.5975 | 0.051 | 0.0026 |
| UC2 | 0.641 | 0.4109 | −0.117 | 0.0137 | |
| UC3 | 0.909 | 0.8263 | −0.089 | 0.0079 | |
| UC4 | 0.765 | 0.5852 | 0.160 | 0.0256 | |
| TA | TA1 | 0.644 | 0.4147 | −0.075 | 0.0056 |
| TA2 | 0.691 | 0.4775 | −0.087 | 0.0076 | |
| TA3 | 0.743 | 0.5520 | 0.014 | 0.0002 | |
| TA4 | 0.764 | 0.5837 | 0.165 | 0.0272 | |
| Average | 0.7919 | 0.6415 | 0.0002 | 0.0176 |
Fig. 3Results of the structural model
The summary of the hypotheses testing
| Hypothesis | Path coefficient ( | Result | |
|---|---|---|---|
| 0.145† | 1.744 | Marginally Supported | |
| 0.261** | 2.732 | Supported | |
| 0.072n.s. | 1.173 | Not Supported | |
| 0.159* | 1.972 | Supported | |
| 0.156* | 2.025 | Supported | |
| 0.154* | 2.08 | Supported | |
| 0.149* | 1.978 | Supported | |
| 0.091n.s. | 0.737 | Not Supported | |
| 0.130† | 1.789 | Marginally Supported | |
| 0.196* | 2.401 | Supported | |
| 0.757*** | 24.864 | Supported | |
| 0.760*** | 22.469 | Supported |
†p < 0.1; *p < 0.05; **p < 0.01; ***p < 0.001 (two-tailed); n.s. = not significant