| Literature DB >> 29145413 |
Roby Velez1, Jeff Clune1,2.
Abstract
A long-term goal of AI is to produce agents that can learn a diversity of skills throughout their lifetimes andEntities:
Mesh:
Year: 2017 PMID: 29145413 PMCID: PMC5690421 DOI: 10.1371/journal.pone.0187736
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Diffusion treatments outperform non-diffusion treatments across various metrics.
(A) Across all generations diffusion treatments (PA_D & PCC_D) achieve significantly higher (p < 0.001) fitness than non-diffusion (PA & PCC) treatments. (B) Diffusion treatments maintain consistent fitness over their lifetime after the first two seasons, indicating they remember how to solve a task even after they have not performed that task for an entire season. Non-diffusion treatments do not. (C) Diffusion treatments have significantly higher (p < 0.001) Retained Percentages and Perfect (i.e. know both summer and winter) seasonal associations than non-diffusion. (D) Diffusion treatments posses significantly higher (p < 0.001) testing fitness than non-diffusion treatments. Throughout paper, all statistics are done with the Mann-Whitney U test. Markers below line plots indicate a significant difference (p < 0.001) between PA_D and the other treatments at the corresponding data point. For all bar plots, except when stated, a significance bar labeled with ‘***’ is placed between bars that are significant at the level of p < 0.001. Lastly, the summary value and confidence intervals for all plots in this paper are the median and 75th and 25th percentiles respectively.
Fig 2High-performing networks are differentiated from low-performing networks through the presence of distinct functional modules in Core Functional Networks (CFNs), network features not seen when just examining the original, non-simplified ANNs.
(A) Original ANNs for the networks with the best and worst test fitness. Inset text is the training fitness (trainF), testing fitness (testF), and structural modularity of the original ANN (origM) averaged over all 80 environments in the post-evolution analysis. Superficially there is nothing that distinguishes networks that have the best testing fitness. (B) One example CFN for each of the corresponding ANN from A. Inset text is the structural modularity of the original ANN (origM), training fitness (trainF), testing fitness (testF), CFN fitness (cfnF), and CFN modularity (cfnM) for the environment that produced the CFN. High-performing networks possess sparse CFNs with either two distinct functional modules (red and blue) that form separate paths, or a common functional module (green) that branches off into two distinct functional modules. Low-performing networks possess CFNs that are much more entangled, or do not connect to the decision input bits (input nodes marked with ‘D’) or season outputs. Structural modularity is quantified with the Q-Score metric [40]. 20 additional CFNs for the best and worst individuals are provided in S4, S5, S6 and S7 Figs. For diffusion ANNs the locations of the point sources are indicated by small, purple, filled circles (S1 Fig) and the modulatory nodes for non-diffusion ANNs are indicated by circles with thick white borders. Nodes whose activation variance is below 1.0 × 10−9 are deemed to be bias nodes and are visualized with thin, outgoing connections.
Fig 3Structural modularity scores for Core Functional Networks (CFNs) (i.e. functional modularity) sets diffusion networks apart from non-diffusion treatments.
(A) Structural modularity Q-Scores for Core Functional Networks (CFNs) (i.e. functional modularity). (B,C) Scatter plots of functional modularity versus two quantifiable measures of high performance: Retained Percent and testing fitness.
Fig 4Diffusion treatments change only connections that will become the summer and winter functional modules in those respective seasons, while non-diffusion treatments change either all connections (PA) or only common connections every season (PCC).
Note, weight change is also occurring in the other connections within range of the points sources, but we plot only connections that eventually become functional and encode task information. Bars are not statistically compared to one another.
Fig 5An illustration of (A) standard and (B) diffusion-based neuromodulation.
For both, the activation of node 3 depends on the activations of nodes 0 and 1 and the connecting weights w3,0 and w3,1 (Eq 3). The changes in weights w3,0 and w3,1 rely on the sum of modulatory signals received by node 3 (Eq 4). (A) In standard neuromodulation, the modulatory signal comes from the direct connection from the modulatory node 2 (Eq 5). (B) In the diffusion-based neuromodulation implementation in this paper, the modulatory signal comes from the concentration gradients released by the point sources. In this example, the modulatory signal of node 3 is determined by its distance to the summer point source.
Fig 6The first four steps of the ARK procedure for finding the summer functional network.
Fig 7Combination of functional subnetworks to produce functional modules.
(A) ARK identifies the functional subnetworks for summer and winter, red and blue connections respectively, in an ANN. (B) Non-functional connections are removed. Functional subnetworks are combined to produce the final functional modules for summer (red connections), winter (blue connections), and common (green connections). (C) As a final visualization technique, connections from bias nodes are made thin.