| Literature DB >> 25954306 |
Gabriel Recchia1, Magnus Sahlgren2, Pentti Kanerva3, Michael N Jones4.
Abstract
Circular convolution and random permutation have each been proposed as neurally plausible binding operators capable of encoding sequential information in semantic memory. We perform several controlled comparisons of circular convolution and random permutation as means of encoding paired associates as well as encoding sequential information. Random permutations outperformed convolution with respect to the number of paired associates that can be reliably stored in a single memory trace. Performance was equal on semantic tasks when using a small corpus, but random permutations were ultimately capable of achieving superior performance due to their higher scalability to large corpora. Finally, "noisy" permutations in which units are mapped to other units arbitrarily (no one-to-one mapping) perform nearly as well as true permutations. These findings increase the neurological plausibility of random permutations and highlight their utility in vector space models of semantics.Entities:
Mesh:
Year: 2015 PMID: 25954306 PMCID: PMC4405220 DOI: 10.1155/2015/986574
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Retrieval accuracies for convolution-based associative memories with Gaussian vectors.
Figure 2Retrieval accuracies for RP-based associative memories with Gaussian vectors.
Figure 3Retrieval accuracies for RP-based associative memories with sparse vectors. The first number reported in the legend (6 or 12) refers to the number of pairs stored in a single memory vector, while the other (256 or 512) refers to the vector dimensionality.
Figure 4Retrieval accuracies for RP-based associative memories with sparse vectors.
Comparisons of variants of BEAGLE differing by binding operation.
| Task | Wikipedia subset | Full Wikipedia | |
|---|---|---|---|
| Convolution | Random permutation | Random permutation | |
| ESL | 0.20 | 0.26 | 0.32 |
| TOEFL | 0.46† | 0.46† | 0.63† |
| RG | 0.07 | −0.06 | 0.32∗ |
| MC | 0.08 | −0.01 | 0.33∗ |
| R | 0.06 | −0.04 | 0.35∗ |
| F | 0.13∗ | 0.12∗ | 0.33∗ |
∗Significant correlation, P < 0.05, one-tailed.
†Accuracy score differs significantly from chance, P < 0.05, one-tailed.
Note. For synonymy tests (ESL, TOEFL), values represent the percentage of correct responses.
For all other tasks, values represent Spearman rank correlations between human judgments of semantic similarity and those of the model. Abbreviations for tasks are defined in the main text of the paper.
Comparison of BEAGLE and RPM by corpus.
| Task | Wikipedia subset | TASA | Full Wikipedia | ||
|---|---|---|---|---|---|
| BEAGLE | RPM | BEAGLE | RPM | RPM | |
| ESL | 0.24 | 0.27 | 0.30 | 0.36† | 0.50† |
| TOEFL | 0.47† | 0.40† | 0.54† | 0.77† | 0.66† |
| RG | 0.10 | 0.10 | 0.21 | 0.53∗ | 0.65∗ |
| MC | 0.09 | 0.12 | 0.29 | 0.52∗ | 0.61∗ |
| R | 0.09 | 0.03 | 0.30 | 0.56∗ | 0.56∗ |
| F | 0.23∗ | 0.19∗ | 0.27∗ | 0.33∗ | 0.39∗ |
∗Significant correlation, P < 0.05, one-tailed.
†Accuracy score differs significantly from chance, P < 0.05, one-tailed.
Note. For synonymy tests (ESL, TOEFL), values represent the percentage of correct responses.
For all other tasks, values represent Spearman rank correlations between human judgments of semantic similarity and those of the model. Abbreviations for tasks are defined in the main text of the paper.
Figure 5(a) Visual representation of a random permutation function, instantiated by a one-layer recurrent network that maps each node to a unique node on the same layer via copy connections. The network at left would transform an input pattern of 〈0.1,0.2,0.3,0.4,0.5〉 to 〈0.5,0.1,0.4,0.2,0.3〉. (b) A one-layer recurrent network in which each node is mapped to a random node on the same layer, but which lacks the uniqueness constraint of a random permutation function. Multiple inputs feeding into the same node are summed. Thus, the network at right would transform an input pattern of 〈0.1,0.2,0.3,0.4,0.5〉 to 〈0.4,0.1,0.8,0, 0.2〉. At high dimensions, replacing the random permutation function in the vector space model of Sahlgren et al. [17] with an arbitrarily connected network such as this has minimal impact on fits to human semantic similarity judgments (Experiment 4).
Comparison of RPM using random connections (RC) versus random permutations (RP).
| Task | Full Wikipedia | TASA | ||||
|---|---|---|---|---|---|---|
| RC Sim 1 | RC Sim 2 | RP | RC Sim 1 | RC Sim 2 | RP | |
| ESL | 0.50† | 0.48† | 0.50† | 0.32 | 0.32 | 0.36† |
| TOEFL | 0.66† | 0.66† | 0.66† | 0.73† | 0.74† | 0.77† |
| RG | 0.66∗ | 0.64∗ | 0.65∗ | 0.53∗ | 0.52∗ | 0.53∗ |
| MC | 0.63∗ | 0.61∗ | 0.61∗ | 0.53∗ | 0.51∗ | 0.52∗ |
| R | 0.58∗ | 0.56∗ | 0.56∗ | 0.55∗ | 0.54∗ | 0.56∗ |
| F | 0.39∗ | 0.37∗ | 0.39∗ | 0.32∗ | 0.33∗ | 0.33∗ |
∗Significant correlation, P < 0.05, one-tailed.
†Accuracy score differs significantly from chance, P < 0.05, one-tailed.
Note. For synonymy tests (ESL, TOEFL), values represent the percentage of correct responses.
For all other tasks, values represent Spearman rank correlations between human judgments of semantic similarity and those of the model. Abbreviations for tasks are defined in the main text of the paper.