| Literature DB >> 35310584 |
Nadrh Abdullah Alhassan1, Abdulaziz Saad Albarrak1, Surbhi Bhatia1, Parul Agarwal2.
Abstract
With the advent of artificial intelligence and proliferation in the demand for an online dialogue system, the popularity of chatbots is growing on various industrial platforms. Their applications are getting widely noticed with intelligent tools as they are able to mimic human behavior in natural languages. Chatbots have been proven successful for many languages, such as English, Spanish, and French, over the years in varied fields like entertainment, medicine, education, and commerce. However, Arabic chatbots are challenging and are scarce, especially in the maintenance domain. Therefore, this research proposes a novel framework for an Arabic troubleshooting chatbot aiming at diagnosing and solving technical issues. The framework addresses the difficulty of using the Arabic language and the shortage of Arabic chatbot content. This research presents a realistic implementation of creating an Arabic corpus for the chatbot using the developed framework. The corpus is developed by extracting IT problems/solutions from multiple domains and reliable sources. The implementation is carried forward towards solving specific technical solutions from customer support websites taken from different well-known organizations such as Samsung, HP, and Microsoft. The claims are proved by evaluating and conducting experiments on the dataset by comparing with the previous researches done in this field using different metrics. Further, the validations are well presented by the proposed system that outperforms the previously developed different types of chatbots in terms of several parameters such as accuracy, response time, dataset data, and solutions given as per the user input.Entities:
Mesh:
Year: 2022 PMID: 35310584 PMCID: PMC8930221 DOI: 10.1155/2022/1844051
Source DB: PubMed Journal: Comput Intell Neurosci
Comparison of a few of the relevant works pertaining to the Arabic chatbots.
| Ref no. | Motivation | Datasets used | Obtained accuracy | Limitations |
|
| ||||
| [ | This article introduces Nabiha, a new Arabic dialect chatbot dedicated to assisting college students | The datasets collected are 248 inputs/outputs from the KSU IT students' accounts in | The result of this paper shows that 15.38% of the total answer is not accurate, 53.85% is somewhat accurate, and 30.77% is very precise | The dataset should be expanded; we need to address the issue of HTML tags as well as Twitter's text area constraints |
| [ | BOTTA's goal is to create a conversational environment and connect with as many Arab users as possible. She's the first chatbot to speak in an Arabic dialect, which helps her achieve her goal of amusing people who are used to conversing in the language | BOTTA is using AIML and launched it on the PANDORABOT platform | BOTTA's pattern matching will be able to correct 85.1 percent of the spelling errors observed in spontaneous Arabic typing | Using corpus-based machine learning approaches, BOTTA's pattern matching has to be improved. Further development will involve morphological analysis of the input and experiments with lemma-based pattern matching using existing tools for Egyptian Arabic processing |
Figure 1System proposal framework of devolving an Arabic troubleshooting chatbot to diagnose and solve technical issues.
Some of the open-source websites with the common companies for IT troubleshooting.
| Name of company | Name of the website | Does it provide Arabic content? |
|
| ||
| Microsoft |
| Yes |
| HP |
| Yes |
| Samsung |
| Yes |
| Apple |
| Yes |
| Huawei |
| Yes |
Some open-source websites used to gather data for IT troubleshooting.
| Name of the website | Does it provide Arabic content? | Does it need translation? |
|
| ||
|
| Yes | No |
|
| Yes | No |
|
| Yes | No |
Figure 2Final form of data scraped after running the code categorized.
Figure 3Final form of data scraped after running the code categorized.
Presentation of an example of an Arabic dialogue implemented by the chatbot.
| Dialogue in Arabic |
| “المساعد الذكي: آسمي المساعد الذكي,أذكر لي مشكلتك؟ وسأحاول جاهدا لحلها!” |
| “المستخدم: أريد إزالة تحديث في جهازي” |
| Here, the user uses only 2 words that match with chatbot corpus: “إزاله” and “تحديث” and still the chatbot diagnoses the problem by saying “إزالة تحديث مثبت في ويندز” |
| “المساعد الذكي: إزالة تحديث مثبت في ويندز: |
| And retrieve the most relevant answer: |
| “لا نوصي بإزالة أي من التحديثات التي تم تثبيتها ومع ذلك، إذا كان من الضروريّ إزالة أحد التحديثات، فيمكنك القيام بذلك في محفوظات التحديثات” |
| Translation of the dialogue |
| Chatbot: hi, my name is “the smart assistant” tell me your technical problem? And I will try to help you |
| User: I want to remove an update on my device |
| Chatbot: we do not recommend removing any installed updates. However, if it is necessary to remove an update, you can do it in the update history.” |
Comparison of the chatbots (our chatbot and Pandorabot).
| Comparison criteria | Chatbot produces by using our proposal framework | Chatbot produces by using Pandorabot platform |
|
| ||
| Type of dataset | - Use unstructured Arabic data (text/yaml). And there is no need to define each question/answer because our chatbot can search through all corpus within seconds to retrieve the data. | - Use AMIL file required to define each question as <pattern></pattern> and the answer as <template></template> for each dialog. |
| Time of response | The answer was retrieved immediately. | The answer took around 30 seconds to retrieve, and it could be “apologize message.” |
| Dealing with user input | - The user can insert any Arabic input by using the formal and informal Arabic language. | - The user can insert only the exact question already fixed in the AIML file. Except that, the chatbot will ignore the question and retrieve “apologize message.” |
| The experimentation | We insert the same dataset for both bots. And we use the same question format. The question used is “نسيت كلمة مرور جوالي الايفون” and the answer should be | |
| The result | - If the question is in informal Arabic language, the bot retrieves the correct answer. Input: “نسيت كلمة مرور جوالي الايفون” | - If the question is in informal Arabic language, the bot retrieves “apologize message.” |
| Finding | - Our chatbot was capable of having any type of Arabic data without having strict inputs to a specific form. The bot uses AI keyword matching to analyze user input and match it with the most relevant problem/solution. | - Pandorasbot working as simple chatbot was capable of matching a text string and offering an answer only when the exact sentence match is found. |
Figure 4Goal-oriented chatbots evaluation for “المساعد الذكي” chatbot.
General analytics of goal-oriented evaluation part 1.
| Dialogs | Problem solved (yes = 1, no = 0) | Problem solved (first reply = 1/second reply = 0,1/more than 2 replies = 0) | Answer relevant (yes = 1, no = 0) |
|
| |||
| D1 | 1 | 0 | 1 |
| D2 | 1 | 0.1 | 1 |
| D3 | 0 | 0 | 0 |
| D4 | 1 | 1 | 1 |
| D5 | 1 | 1 | 1 |
| D6 | 0 | 0 | 0 |
Figure 5Solved/unsolved cases presented in pie chart.
Figure 6Relevant/nonrelevant chatbot's answer presented in pie chart.
General analytics of goal-oriented evaluation part 2.
| Dialogs | Dialogue length (in sentence/line) | User utterance length (in sentence/line) | Number of times in which the participant paraphrases the question |
|
| |||
| D1 | 16 sentences | 5 sentences | 3 times |
| D2 | 13 sentences | 4 sentences | 1 time |
| D3 | 17 sentences | 8 sentences | 4 times |
| D4 | 11 sentences | 4 sentences | 0 times |
| D5 | 7 sentences | 3 sentences | 0 times |
| D6 | 14 sentences | 7 sentences | 3 times |
| The average | 13.17 | 5.17 | 1.8 |