| Literature DB >> 36160369 |
Nur Siyam1, Sherief Abdallah1.
Abstract
Children with autism spectrum disorder (ASD) usually show little interest in academic activities and may display disruptive behavior when presented with assignments. Research indicates that incorporating motivational variables during interventions results in improvements in behavior and academic performance. However, the impact of such motivational variables varies between children. In this paper, we aim to address the problem of selecting the right motivator for children with ASD using reinforcement learning by adapting to the most influential factors impacting the effectiveness of the contingent motivator used. We model the task of selecting a motivator as a Markov decision process problem. The states, actions and rewards design consider the factors that impact the effectiveness of a motivator based on applied behavior analysis as well as learners' individual preferences. We use a Q-learning algorithm to solve the modeled problem. Our proposed solution is then implemented as a mobile application developed for special education plans coordination. To evaluate the motivator selection feature, we conduct a study involving a group of teachers and therapists and assess how the added feature aids the participants in their decision-making process of selecting a motivator. Preliminary results indicated that the motivator selection feature improved the usability of the mobile app. Analysis of the algorithm performance showed promising results and indicated improvement of the recommendations over time.Entities:
Keywords: Autism; Behavior intervention; Intervention therapy; Markov decision processes; Reinforcement learning; Special education
Year: 2022 PMID: 36160369 PMCID: PMC9483340 DOI: 10.1007/s10209-022-00914-7
Source DB: PubMed Journal: Univers Access Inf Soc ISSN: 1615-5289 Impact factor: 2.629
Fig. 1Behavior intervention strategy
Features representing the state space
| Feature | Description | Number of values | References |
|---|---|---|---|
| Antecedent event (trigger) | Event or activity that immediately preceded a problem behavior (alone, given a direction or demand, transitioned to new activity, denied access to an item) | 4 | Bhuyan et al. [ |
| Time of Day | Time of the day the problem behavior occurred (morning, noon, evening) | 3 | Burns et al. [ |
| Subject | The aim of this feature is to account to the place and person the problem behavior occurred with (academic subjects, therapy sessions, home) | 8 | Burns et al. [ |
| Behavior | The problem behavior that requires intervention, grouped into seven categories (aggression, self-injury, disruption, elopement, stereotypy, tantrums, non-compliance) | 7 | Stevens et al. [ |
| Behavior Function | The reason the behavior is occurring (sensory stimulation, escape, access to attention, access to tangibles) | 4 | Alstot and Alstot [ |
| Last unsuccessful motivator | The ID of the last motivator used that was not successful in motivating the student within an episode, including an option for “none” | 7 | |
| Motivator past usage | The number of times each motivator was used within a week grouped in categories of < 5, 5–10, > 11. This factor is composed of six features according to the number of motivators (actions) available (edibles, sensory, activities, tokens, social, choice) | 36 | Çetin [ |
Motivators categories
| Motivator | Description |
|---|---|
| Edible | Food items, such as fruits, snacks, and juice |
| Sensory | Items or activities that realizes pleasure to the senses of the child, such as listening to music, sitting in a rocking chair, or playing with sand |
| Activity | Activities may include drawing, playing with the computer, or jumping on a trampoline |
| Token | Tangible items that the child values, such as stickers, money, or stars on an honor chart |
| Social | Attention or interaction with another person, such as high-fives, smiles, and praise |
| Choice | Giving the child the chance to choose between two different items or methods, such as asking whether she prefers to use a pencil or crayons to write |
Scale of child’s responsiveness (adapted from Koegel and Egel [36])
| Output | Description | Reward |
|---|---|---|
| Negative | Child continues problem behavior (tantrums, kicking, screaming) or does not comply with instructions and engages in behavior unrelated to the activity (rocking, yawning, tapping) | − 1 |
| Neutral | Complies with instructions but tends to get restless or loses attention | + 2 |
| Positive | Performs task readily. Attends to task quickly, smiles while doing the task, and presents appropriate behavior | + 4 |
| Rejected recommendation | The user rejects the motivator recommendation and does not introduce it to the child | − 0.25 |
| Edible item | The motivator selected was an edible item | − 1 |
| Token item | The motivator selected was a token item | − 0.5 |
Fig. 3“Motivator Selection” feature
Fig. 2Behavior monitoring without the “Motivator Selection” feature
Learners’ demographics
| Category | |
|---|---|
| Female | 1 |
| Male | 11 |
| 6 | 3 |
| 7 | 5 |
| 8 | 2 |
| 9 | 2 |
| Level 2 | 5 |
| Level 3 | 7 |
Teachers’ and therapists’ demographics
| Category | |
|---|---|
| Female | 11 |
| Male | 1 |
| Egyptian | 7 |
| Jordanian | 3 |
| Syrian | 2 |
| Subject teacher | 10 |
| Occupational Therapist | 1 |
| Behavior Specialist | 1 |
Fig. 4New SUS mean score compared to the SUS mean score of the previous iteration
Descriptive summary for “no-algorithm” dataset variables
| Variable | Total | Per | Mean | SD | Min | Max |
|---|---|---|---|---|---|---|
| Users | 12 | Research | ||||
| Students | 12 | Research | ||||
| Days | 30 | Research | ||||
| Entries | 490 | User (therapists and teachers) | 40.833 | 40.653 | 6 | 141 |
Descriptive summary for the “algorithm” dataset variables
| Variable | Total | Per | Mean | SD | Min | Max |
|---|---|---|---|---|---|---|
| Users | 12 | Research | ||||
| Students | 12 | Research | ||||
| Days | 32 | Research | ||||
| Entries | 1231 | User (therapists and teachers) | 102.58 | 98.72 | 8 | 376 |
| Episodes | 598 | Day | 38.469 | 26.39 | 1 | 91 |
| Steps (episode length) | Episode | 2.06 | 2.35 | 1 | 22 | |
| Reward | − 2.00 | 4.00 | ||||
| Reward sum | Episode | 2.68 | 1.478 | − 4.50 | 4.00 |
Fig. 5Percentage of effectiveness of motivator with and without using the motivator selection feature
Cross-tabulation of algorithm versus status
| Status | Total | ||
|---|---|---|---|
| Not motivated | Motivated | ||
| Algorithm | |||
| Without | 267 | 223 | 490 |
| 54.5% | 45.5% | 100.0% | |
| With | 69 | 594 | 663 |
| 10.4% | 89.6% | 100.0% | |
| Total | 336 | 817 | 1153 |
| 29.1% | 70.9% | 100.0% | |
Pearson χ2 = 265.16, p < 0.001. Symmetric measures: contingency coefficient = 0.432, p < 0.001
Cells (0.0%) have expected count less than 5. The minimum expected count is 142.79
Summary of logistic regression analysis for algorithm application predicting students’ motivation
| Variable | SE | Wald | Sig | Exp( | ||
|---|---|---|---|---|---|---|
| Algo(1) | 2.333 | 0.156 | 222.987 | 1 | 0.000 | 10.307 |
| Constant | − 0.180 | 0.091 | 3.940 | 1 | 0.047 | 0.835 |
− 2LL = 1118.132, Nagelkerke R2 = 0.301, χ2 = 273.337, df = 1, p < 0.001
Classification accuracy = 74.7%
Fig. 6Scatterplot of reward over time
Regression analysis summary for sequence predicting reward
| Variable | 95% CI | ||||
|---|---|---|---|---|---|
| (Constant) | 2.386 | [2.125, 2.647] | 17.944 | < 0.001 | |
| Sequence | 0.000367 | [0.000024, 0.000710] | 0.081 | 2.101 | 0.036 |
R2 = 0.007, R2adj. = 0.005, CI = confidence interval for B
Regression analysis summary for sequence predicting reward (< 3)
| Variable | 95% CI | ||||
|---|---|---|---|---|---|
| (Constant) | 0.475 | [0.136, 0.813] | 2.764 | 0.006 | |
| Sequence | 0.000627 | [0.000177, 0.001077] | 0.169 | 2.742 | 0.007 |
R2 = 0.028, R2adj. = 0.025, CI = confidence interval for B
Fig. 7Scatterplot of reward (< 3) over time
Fig. 8Scatterplot of episode length against time
Regression analysis summary for sequence predicting episode length
| Variable | 95% CI | ||||
|---|---|---|---|---|---|
| (Constant) | 2.758 | [2.366, 3.149] | 13.841 | < 0.001 | |
| Sequence | − 0.001021 | [− 0.001524, − 0.000518] | − 0.161 | − 3.990 | < 0.001 |
R2 = 0.026, R2adj. =0 .024, CI = Confidence Interval for B
Regression analysis summary for sequence predicting episode length (> 2)
| Variable | 95% CI | ||||
|---|---|---|---|---|---|
| (Constant) | 6.482 | [5.312, 7.652] | 10.965 | < 0.001 | |
| Sequence | − 0.001903 | [− 0.003596, − 0.000210] | − 0.194 | − 2.225 | 0.028 |
R2 = 0.038, R2adj. = 0.030, CI = confidence interval for B
Fig. 9Scatterplot of episode length (> 2) against time
SUS questionnaire items in English and Arabic