| Literature DB >> 35445745 |
Anna Plessas1, Josafath I Espinosa-Ramos1, Dave Parry1, Sarah Cowie2, Jason Landon1.
Abstract
An accumulated body of choice research has demonstrated that choice behavior can be understood within the context of its history of reinforcement by measuring response patterns. Traditionally, work on predicting choice behaviors has been based on the relationship between the history of reinforcement-the reinforcer arrangement used in training conditions-and choice behavior. We suggest an alternative method that treats the reinforcement history as unknown and focuses only on operant choices to accurately predict (more precisely, retrodict) reinforcement histories. We trained machine learning models known as artificial spiking neural networks (SNNs) on previously published pigeon datasets to detect patterns in choices with specific reinforcement histories-seven arranged concurrent variable-interval schedules in effect for nine reinforcers. Notably, SNN extracted information from a small 'window' of observational data to predict reinforcer arrangements. The models' generalization ability was then tested with new choices of the same pigeons to predict the type of schedule used in training. We examined whether the amount of the data provided affected the prediction accuracy and our results demonstrated that choices made by the pigeons immediately after the delivery of reinforcers provided sufficient information for the model to determine the reinforcement history. These results support the idea that SNNs can process small sets of behavioral data for pattern detection, when the reinforcement history is unknown. This novel approach can influence our decisions to determine appropriate interventions; it can be a valuable addition to our toolbox, for both therapy design and research.Entities:
Keywords: artificial intelligence; choice research; machine learning prediction; reinforcement history; spiking neural networks
Mesh:
Year: 2022 PMID: 35445745 PMCID: PMC9320819 DOI: 10.1002/jeab.759
Source DB: PubMed Journal: J Exp Anal Behav ISSN: 0022-5002 Impact factor: 2.215
Figure 1Left: the Structure of a Biological Neuron. Right: a Sample of a Brain‐Inspired Computational Neuron (Chen et al., 2018)
The Relative Reinforcer Probability (Shown as Probability of Reinforcement to the Left Alternative) for Each of the Seven Concurrent VI Schedules (Referred to as Components) for Both Conditions
| Component | Reinforcer Ratio (Left: Right) |
|---|---|
| 1 | 1:27 |
| 2 | 1:9 |
| 3 | 1:3 |
| 4 | 1:1 |
| 5 | 3:1 |
| 6 | 9:1 |
| 7 | 27:1 |
Note. The overall probability of reinforcement per second was constant at 0.037.
Samples Extracted by the Pigeons' Temporal Data
| Experimental Condition | Datasets | No of Pigeons | Points of 5‐sec periods |
|---|---|---|---|
| Cond. 1 | Training the model | 5 Pigeons | 90 periods |
| Testing model | 1 Pigeon | 90 periods | |
| Cond. 6 | Generalization‐1 | 6 Pigeons | 90 periods |
| Generalization‐2 | 6 Pigeons | 63 periods | |
| Generalization‐3 | 6 Pigeons | 45 periods |
Note. Some samples consisted of <90 points as some of the 10 daily sessions ended at the prearranged time (45 min), and the pigeon had not consumed all reinforcers.
Figure 2The SNN Architecture Constructed for Classification of Seven Components Based on Pigeons' Choices
Figure 3A Schematic Representation of K‐Fold Cross‐Validation Training When Using Right and Left Responses (Pigeons' Choices)
Performance Metrics Used in this Study to Interpret Results and Formulas Used for Calculations
| Measure | Description | Formula |
|---|---|---|
| Accuracy | The fraction of correctly predicted events in relation to all data |
|
| Recall (or sensitivity) | The proportion of correctly predicted actual events (i.e., a true positive) in reference to the total true events |
|
| Specificity | The proportion of correctly predicted nonevents (i.e., a true negative) in reference to the total nonevents |
|
| Informedness | The probability of an informed decision (or Youden's index) |
|
| Precision | The fraction of correctly predicted actual events in reference to retrieved events |
|
| F1 | Weighted average of precision and sensitivity |
|
The Overall Results of the Six Best Models for all Combinations
| Outcomes | Pigeon 61 | Pigeon 62 | Pigeon 63 | Pigeon 64 | Pigeon 65 | Pigeon 66 |
|---|---|---|---|---|---|---|
| Overall | overall | overall | overall | overall | overall | |
| Accuracy | 0.96 | 0.96 | 0.96 | 0.96 | 0.94 | 0.93 |
| Recall Specificity | 0.87 | 0.87 | 0.87 | 0.86 | 0.81 | 0.81 |
| Informedness | 0.98 | 0.98 | 0.97 | 0.97 | 0.96 | 0.96 |
| Precision | 0.85 | 0.85 | 0.84 | 0.83 | 0.77 | 0.77 |
| F1 | 0.87 | 0.87 | 0.87 | 0.86 | 0.81 | 0.81 |
| 0.87 | 0.87 | 0.87 | 0.86 | 0.81 | 0.81 |
Note. The results reflect cross‐validation (CV) training and generalization testing for validation. The results are listed per pigeon used for testing generalization.
Figure 4A Normalized Confusion Matrix Across all Seven Components with the Horizontal Line Representing the Retrodicted Component and the Vertical Line the Actual Component
The Results of the Best Performing Model for all Combinations when Splitting the Data
| Pigeon 61 | Pigeon 62 | Pigeon 63 | Pigeon 64 | Pigeon 65 | Pigeon 66 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CV | Test | CV | Test | CV | Test | CV | Test | CV | Test | CV | Test | |
| Accuracy | 0.94 | 0.97 0.91 0.98 | 0.95 | 0.97 0.91 0.98 | 0.94 | 0.97 0.91 0.98 | 0.95 | 0.96 0.89 0.98 | 0.95 | 0.92 0.77 0.95 | 0.96 | 0.91 0.74 0.94 |
| Recall | 0.82 | 0.79 | 0.83 | 0.90 | 0.82 | 0.90 | 0.83 | 0.87 | 0.85 | 0.72 | 0.87 | 0.69 |
| Specificity | 0.96 | 0.91 | 0.97 | 0.91 | 0.96 | 0.91 | 0.97 | 0.88 | 0.97 | 0.77 | 0.97 | 0.74 |
| Informedness | 0.79 | 0.90 | 0.80 | 0.91 | 0.79 | 0.91 | 0.80 | 0.88 | 0.82 | 0.77 | 0.84 | 0.74 |
| Precision | 0.82 | 0.83 | 0.82 | 0.83 | 0.82 | 0.87 | ||||||
| F1 | 0.83 | 0.83 | 0.82 | 0.83 | 0.85 | 0.87 |
Note. The results reflect cross‐validation (CV) training and testing for validation.
Figure 5Log Response Ratios of Choices Emitted During the First 5 s Following Each Successive Response in Each of the Seven Components of Condition 1, of Landon & Davison ( 2001 )
Figure 6Total Number of Pair Responses Emitted for the First 5 s Following Each Successive Response in Each of the Seven Components of Condition 1, from Landon & Davison ( 2001 )
The Overall Results of the Best Model for Each Generalization Test
| Generalization Test | # | Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | |
|---|---|---|---|---|---|---|---|---|
| G1 | 10d | Accuracy | 0.96 | 0.95 | 0.95 | 0.95 | 0.94 | 0.94 |
| Sensitivity | 0.89 | 0.85 | 0.85 | 0.84 | 0.82 | 0.82 | ||
| Specificity | 0.98 | 0.97 | 0.97 | 0.96 | 0.96 | 0.96 | ||
| Informedness | 0.87 | 0.82 | 0.82 | 0.81 | 0.78 | 0.79 | ||
| Precision | 0.89 | 0.85 | 0.85 | 0.84 | 0.82 | 0.82 | ||
| F1 | 0.89 | 0.85 | 0.85 | 0.84 | 0.82 | 0.82 | ||
| G2 | 7d | Accuracy | 0.97 | 0.95 | 0.95 | 0.95 | 0.95 | 0.95 |
| Sensitivity | 0.89 | 0.86 | 0.87 | 0.84 | 0.86 | 0.86 | ||
| Specificity | 0.98 | 0.98 | 0.98 | 0.97 | 0.97 | 0.97 | ||
| Informedness | 0.87 | 0.83 | 0.83 | 0.81 | 0.83 | 0.83 | ||
| Precision | 0.89 | 0.86 | 0.86 | 0.84 | 0.86 | 0.86 | ||
| F1 | 0.89 | 0.86 | 0.86 | 0.84 | 0.86 | 0.86 | ||
| G3 | 5d | Accuracy | 0.93 | 0.93 | 0.93 | 0.93 | 0.93 | 0.93 |
| Sensitivity | 0.82 | 0.79 | 0.79 | 0.79 | 0.80 | 0.79 | ||
| Specificity | 0.96 | 0.96 | 0.96 | 0.96 | 0.96 | 0.96 | ||
| Informedness | 0.76 | 0.75 | 0.76 | 0.75 | 0.75 | 0.75 | ||
| Precision | 0.81 | 0.79 | 0.80 | 0.79 | 0.79 | 0.79 | ||
| F1 | 0.81 | 0.79 | 0.80 | 0.79 | 0.79 | 0.79 |
Note. The results reflect generalization tests per pigeon.