Literature DB >> 32251443

Exploring individual variation in associative learning abilities through an operant conditioning task in wild baboons.

Claudia Martina^1,2, Guy Cowlishaw², Alecia J Carter^1,2,3.

Abstract

Cognitive abilities underpin many of the behavioural decisions of animals. However, we still have very little understanding of how and why cognitive abilities vary between individuals of the same species in wild populations. In this study, we assessed the associative learning abilities of wild chacma baboons (Papio ursinus) across two troops in Namibia with a simple operant conditioning task. We evaluated the ability of individuals to correctly associate a particular colour of corn kernels with a distasteful flavour through repeated presentations of two small piles of corn dyed different colours, one of which had been treated with a non-toxic bitter substance. We also assessed whether individual variation in learning ability was associated with particular phenotypic traits (sex, social rank and neophilia) and states (age and prior vigilance). We found no evidence of learning the association either within each trial or across trials, nor any variation based on individuals' phenotypes. This appeared to be due to a high tolerance for bitter foods leading to similar acceptance of both palatable and unpalatable kernels. Earlier avoidance of the bitter kernels during pilot trials suggests this higher tolerance may have been largely driven by a drought during the experiments. Overall, our findings highlight the potential influence of current environmental challenges associated with conducting cognitive tests of animals in the wild.

Entities: CellLine Chemical Disease Species

Year: 2020 PMID： 32251443 PMCID： PMC7135308 DOI： 10.1371/journal.pone.0230810

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Learning results from past experiences which allow animals to adjust their behaviour accordingly [1]. Associative learning—a cognitive process that involves an association between stimuli and reinforcements—is key to many facets of animal behaviour [2], including fitness-related aspects such as foraging behaviour [cue preferences: 3,4; spatial memory: 5] and reproductive success [mate appeasement: 6; mate availability: 7; mating success: 8]. The costs and benefits associated to cues (i.e. highly negative or rewarding outcomes), will ultimately affect fitness, as they will determine the speed and strength with which novel associations are made [9,10]. While ultimately, differences in associative learning abilities between species are likely to reflect adaptations [e.g. 11–13; but see: 14,15], inter-specific differences are also likely to reflect genotype [16,17] or epigenetic changes dependent on developmental trajectory and the environment experienced during their lifetimes [18,19]. One of the most commonly studied types of associative learning is operant conditioning, where learning is reinforced by the individual’s own behaviour [20]. In his original work on operant learning, Skinner [21] trained pigeons and mice to press a lever to obtain a food reward until the animals pressed the lever continuously even in the absence of food. However, research on operant conditioning, as in other areas of animal cognition, has commonly relied on devices and protocols that restrict studies to captive conditions. While captive conditions offer a controlled environment in which to test animals [22,23], they may also limit the generalisability of the findings to cognition in the wild, for three reasons. First, captive research relies heavily on the training of animals for long periods of time [24-26], e.g., Okanoya et al. [26] demonstrated that captive caviomorph rodent degus (Octodon degus) are capable of tool use, but only after a training period of 2,500 trials. As such, conclusions obtained from captive studies may not be entirely representative of natural processes, particularly because opportunities to develop a given ability may be quite different for captive and wild animals. Second, captive animals often develop different behaviours to their wild conspecifics as a result of acquired experience through exposure to human-made objects or enforced proximity with conspecifics [27], e.g., captive animals often demonstrate greater diversity in tool use [27,28]. Conversely, the failure of captive animals to solve tasks that they would never encounter in the wild may equally distort our estimate of that species’ cognitive abilities. This makes it difficult to predict how well the performance of captive subjects represents the abilities of wild conspecifics. Lastly, findings in captivity are likely to be influenced by the highly controlled conditions experienced by the study subjects [29], where the environmental and social aspects of behaviours that occur in natural circumstances, and that influence overall cognitive processing, are often ignored [30]. In this study, we investigated individual variation in the associative learning abilities of wild chacma baboons (Papio ursinus) by presenting individuals with an operant conditioning task that required them to associate colour with taste. Baboons, like humans, have trichromatic colour vision [i.e. they discriminate hues along the visible colour spectrum: 31], a trait predicted to have evolved out of the need to find ripe fruit amongst foliage [32]. The task presented here reflects a biologically relevant design as baboons may use colour changes in plant foods to help assess palatability [e.g., as fruits ripen: 33] and builds on previous studies of animal learning abilities that use colour cues during foraging [e.g. common bumblebees, Bombus terrestris: 3; pipevine swallowtails, Battus philenor: 34; greenfinches, Carduelis chloris: 35]. We tested for evidence of learning, with our null hypothesis being that the baboons would not learn the association between the colour (red or green) and palatability (palatable or bitter) of two food choices across five presentations (trials). We tested three possible mutually-exclusive processes about how the baboons could learn the association: we hypothesised that individuals would either rapidly learn the association between the colour and taste of two food choices during the first trial, after which individuals would choose only the food associated with the palatable colour in subsequent trials (learning process 1); or re-learn the association in each trial as independent events (failing to remember the association between trials), sampling both colours in each trial before selecting the palatable food (learning process 2); or gradually learn the association across trials, improving after each trial until they either largely/completely avoided the distasteful food, or preferred to consume the palatable option before the unpalatable one (learning process 3). In addition, for each of these three possible learning processes, we tested five hypotheses regarding the source of individual variation in the learnt association according to five different phenotypic traits/states, as described below.

Individual variation in associative learning ability: Hypotheses and tests

We tested three phenotypic traits (sex, social rank, neophilia) and two states (age, prior vigilance) that might explain individual differences in learning ability:

Sex

Sex differences have been recorded in a variety of cognitive abilities, including spatial cognition [36,37], innovation [38], and learning [39]. Furthermore, cognitive ability may be under sexual selection. Females may benefit from choosing males who, as a result of enhanced cognitive skills, are able to acquire more resources [40]; e.g., a male’s success in a novel foraging task can correlate with his song complexity, a sexually selected trait females use to choose mating partners [zebra finches, Taeniopygia guttata: 41]. In this study, we predicted that female baboons would be more successful in a given task because, during gestation and lactation, females need to increase nutrient consumption while miminising exposure to plant secondary compounds [42,43], potentially selecting for more discriminative associative learning abilities where foods are concerned.

Age

Success or failure in cognitive tasks is often attributed to individual age [8,44]. For this task, we predicted that adult baboons would outperform juveniles. Although juvenile baboons have higher levels of exploratory behaviour than their older conspecifics [45,46], we might expect any associated advantage in cognitive testing to disappear once this has been accounted for (by controlling for individual neophilia, see below), such that adults outperform juveniles because of their greater experience identifying changes in food items with regards to their colour and taste.

Social rank

Cognitive performance may vary depending on the social rank of individuals [38,47,48]. Based on previous findings, we predicted two possible outcomes. On the one hand, dominants might outperform subordinates, for two reasons: because (i) they could have greater access to key resources (such as more nutritious foods in early life) that allow for better development and maintenance of cognitive abilities; and/or (ii) they are unlikely to be displaced and consequently can afford more time to solve a cognitive challenge [49,50]. On the other hand, subordinates might outperform dominants because low social status has promoted the development of cognitive abilities to circumvent traditional competition with dominants [51,52], for example, subordinate baboons are known to run ahead to access resources before the dominant arrives [53].

Neophilia

Individuals may differ in their reaction to novel situations: some animals avoid novelty, while others are attracted to it. Although both responses are sometimes considered opposite ends of the same continuum, they are independent of one another [54]. Neophobia, an aversion to novel stimuli [45], usually impedes cognitive performance [55] such that neophobic animals are less likely to fully engage with novel situations [48], whereas neophilia, an attraction towards novel stimuli [45,54], is associated with greater innovation and successful problem-solving as individuals are more likely to explore a novel situation [56,57]. Based on these previous findings, we predicted that more neophilic baboons would outperform less neophilic conspecifics.

Vigilance

Vigilance is defined as a state of frequent alertness that increases the likelihood of identifying possible changes or danger in individuals’ immediate environment [58]. Vigilance often requires individuals to devote time and attention away from their current activity. When engaged in a novel task, this may compromise learning and reduce future task performance [40]. We predicted that baboons who were more vigilant during earlier experiences with a task, for instance because they perceive a higher risk of attack from conspecifics and/or predators [e.g. social monitoring: 59; predator detection: 60], would perform less well the next time they attempted the task.

Materials and methods

Fieldwork was carried out over a 6-month field season (April-September 2015) on two fully-habituated troops of chacma baboons, ranging in size from 43 (L troop) to 44 (J troop) individuals over four years of age (S1 Appendix) at Tsaobis Nature Park (15° 45’E, 22° 23’S) on the edge of the Namib Desert, Namibia. All these individuals were individually identifiable. Observers accompanied both troops on foot from dawn to dusk and used Cybertracker software (www.cybertracker.org) on smart phones (Samsung Galaxy S4) to record dominance and social interactions ad libitum. Our five individual traits/states were measured as follows: sex was assigned by physical appearance (baboons are sexually dimorphic and have external genitalia); age was scored as juvenile or adult, where adulthood was defined by menarche in females and canine development in males. Generally, females become adults at 4–5 years of age while males become adult at approx. 9 years of age [61]; social rank was determined using a dyadic winner-loser matrix in which all displacements, supplants, attacks, chases and threats observed between two individuals were recorded. Dominance hierarchies were calculated from the matrix using Matman software (Noldus Information Technology). In both troops, the hierarchy was strongly linear (Landau’s corrected linearity index: h’J = 0.34, h’L = 0.41, nJ = 946, nL = 861, p <0.001 in both cases). Individuals’ ranks were subsequently expressed relatively to control for differences in group size, ranging from 0 (lowest rank) to 1 (highest rank), using the formula 1-[(1-r)/(1-n)], where r is the absolute rank of each individual and n is the total group size [62]; neophilia was measured using an established experimental approach in which individuals’ responses to a novel food (an eighth of an apple dyed blue) were assessed using the time spent inspecting the item (s, median = 4; range = 1–120). All trials were conducted by AJC and have been shown to be repeatable in this population [r = 0.26, P = 0.02: 58]. For further details, see: [63]; and, vigilance was defined as the number of times an individual lifted its head to scan its surroundings during the experimental trials and expressed as a rate per trial (the number of vigilance events observed divided by the total time of the trial (s)). We evaluated whether vigilance behaviour in the preceding trial affected individuals’ responses in the current trial (e.g., the vigilance rate of trial 1 was used in trial 2).

Experimental procedure

Individuals’ associative learning abilities were evaluated with a task in which an association between the colour (red/green) and palatability (bitter/normal) of two piles of corn kernels had to be learned. Across Southern Africa, chacma baboons are notorious crop raiders of maize fields [64], which is a highly desirable and nutritious food. A simple pilot study was conducted by CM prior to this experiment, in Dec 2014 –Jan 2015, to observe individuals’ responses to corn kernels soaked in a non-toxic concentrated bitter solution containing denatonium benzoate (‘Avert’, Kyron Laboratories Pty Ltd). During this pilot, two piles, one palatable and one unpalatable, of approx. 20 uncoloured kernels each, were presented on a single occasion to a random sample of 25 individuals (14 females: 13 adults, 1 juvenile); 11 males: 6 adults, 5 juveniles). Individuals were deemed to be responsive to the bitter solution if they left kernels uneaten after tasting the bitter kernels. Baboons left kernels on 16 out of 25 occasions (64%) and spent a median of 12 s (range: 0–64 s) exploring the kernels (i.e. sniffing, rubbing kernels against forearm fur and/or repeatedly biting and spitting out the kernels). Such exploratory behaviour occurred in 21 out of 25 presentations (84%). In the main experiment, a representative subset of 38 individuals was tested across the two troops. This involved 14 adult females, 3 juvenile females, 6 adult males, and 14 juvenile males, in each case, comprising 38–66% of the identifiable individuals in that age-sex class in our study population (S2 and S3 Appendices provide a breakdown of the number of baboons tested according to their sex-age class and their dominance and neophilia scores respectively). To keep a balanced sample it was necessary to include some individuals from our pilot study (n = 14); however, we considered it unlikely that these individuals’ responses to the task would be any different, as all the baboons had prior experience with corn kernels [see: 65] and none had previously participated in a task involving colour cues indicating variation in palatability. Individuals were presented with two equal amounts of dried maize kernels (approx. 20 kernels each) of different colour and palatability, and their speed of learning this association was assessed over 3–5 presentations (median: 5 presentations). Corn kernels were initially soaked overnight in either a red or green edible food colourant (Moir’s Food Dye); on the following night, one of these colours was soaked again in the bitter solution. For a similar methodology involving vervet monkeys, Chlorocebus aethiops, see [66]. Each troop was presented with a different unpalatable colour (green in J troop, red in L troop). All trials were conducted by CM and an assistant. To avoid test subjects being displaced by dominant animals, or an audience learning socially by observing others, presentations were made to individuals when out of sight of conspecifics. CM and her assistant moved ahead of the foraging individual and waited until it was out of sight of others, at which point the assistant, who was positioned to record the trial a few meters ahead, indicated that the trial could start. CM then placed the two piles of corn on the ground ahead of the baboon while it was looking away. Each pile was approximately 10 cm in diameter and placed on the ground 10 cm apart from each other in a randomised left/right position to avoid any left/right preferences (Fig 1). Because trials could still be interrupted subsequently by other troop members, the same colour/palatability combination was used for all members of the same troop. All individuals received five “test” trials, each separated by intervals of three days, i.e., after the first trial (day 0), individuals were tested on days 3, 6, 9 and 12. If it was not possible to test particular individuals on the assigned day, they were tested the next possible day (re-test interval (days): mean 3.7; median 3.0). Individuals who were tested fewer than four times (n = 1) were not considered for this analysis, with the exception of those tested in the month of May (n = 9: 2 adult females, 3 adult males, 2 juvenile females, 2 juvenile males), who could only be tested in three trials due to logistical complications. All tests were conducted between sunrise (0616–0632 during the testing period) and 1000 (mean testing time: 0745) to control for motivation, as individuals are more likely to have similar levels of hunger earlier in the day. A trial was considered as finished when the individual being tested consumed both piles of kernels in their entirety, walked a minimum of 2 m away from uneaten kernels, or were interrupted by conspecifics whilst they were at the task. We did not test any individual who interrupted a trial and ate from either of the corn piles to avoid the confounding effects of previous experience. We took particular care that those animals that were being tested did not observe or interrupt any of their conspecifics’ evaluation until after their set of trials had finished. All experiments were filmed (Canon Vixia HF R300) to facilitate data extraction.

Fig 1

Images of the baboons after being presented with both piles of coloured corn kernels.

Caption: Shown are (A) an adult male sitting down whilst eating the corn kernels, presented on a rocky surface and (B) an adult female bending over to eat the corn kernels, presented on sand.

Images of the baboons after being presented with both piles of coloured corn kernels.

Caption: Shown are (A) an adult male sitting down whilst eating the corn kernels, presented on a rocky surface and (B) an adult female bending over to eat the corn kernels, presented on sand. With the exception of three trials for which we were were entirely unable to extract data due to camera malfunction, the following data were obtained from the videos for each trial: (1) the colour of every kernel consumed; (2) the time spent eating each pile of kernels; (3) how many kernels were left (if any) from each pile; (4) the frequency of vigilance, measured as the number of times the individual scanned its surroundings by either lifting their head or noticeably moving their eyesight from the task; and (5) the total time dedicated to the task. While obtaining data from each video, we identified three potential sources of ambiguity, which we defined and addressed as follows: (1) when individuals’ bodies blocked the task out of sight we noted the arm movement to control for the number of kernel being consumed, and coded those choices as missing values; (2) when a trial was cut short due to camera malfunction, individuals’ first choice was noted and, depending on whether the malfunction occurred immediately at the start of the trial (i.e. before the 10 first kernels were consumed), that trial was not considered for further analysis; and (3) when individuals were joined by their kin (e.g. infants) during a trial, we took into account only those kernels chosen and consumed by the target animal, subtracting from the final amount the kernels consumed by their kin (if any). If the target individual moved away from the task immediately after being joined by their kin, we considered that trial as completed.

Statistical analysis

All analyses were conducted in R (version 3.2.3, 2015). To test each of our proposed learning processes and their relationships to individual phenotype, we evaluated task performance in three ways which corresponded to the proposed learning processes, respectively. First, using kernel choice (binomial, Palatable, 1; Unpalatable, 0), we investigated every choice of kernel (1–40 kernels) in trial 1 to evaluate whether individuals were capable of learning the association rapidly, within a single presentation, after which they consistently avoided the unpalatable option in subsequent trials (the latter is tested in the subsequent models for trials 2–5) (learning process 1). Second, in a similar manner, we investigated every kernel choice (1–40 kernels) for trials 2–5, using separate models in each case, to test whether learning occurs independently in each presentation (learning process 2). Third, we used the proportion of correct kernels in the first 20 kernels eaten in trials 2–5 (numeric, 0–1) to test whether learning occurred gradually across trials (trial 1 was excluded as across-trial learning would only be evident in subsequent presentations, see below) (learning process 3). To be able to learn the association between colour and palatability, the test subjects had to taste both types of kernel. We predicted that the baboons would have this opportunity by sampling both options at the beginning of the trial. However, this was often not the case, as the animals frequently “bulk” fed, eating one pile of corn entirely before switching to the next pile (Table 1). Consequently, we limited our analyses of within-trial learning (learning processes 1, 2) to those individuals that ate from both piles within the first 20 kernels (Table 1, first data column), i.e. all cases where individuals did not bulk-feed. However, for our analysis of across-trial learning (learning process 3), we used the proportion of correct kernels within the first 20 kernels in trials 2–5, as we assumed that even when individuals bulk-fed they would still acquire information about both piles of kernels by the end of trial 1 (provided they fed from both, which they did), which could then be applied in subsequent presentations.

Table 1

Feeding patterns for the two piles of kernels in each trial.

Trial	Switch between piles within first 20 kernels	“Bulk feeding”: palatable to unpalatable	“Bulk feeding”: unpalatable to palatable	Interruptions
1	13	13	5	3
2	15	10	7	5
3	14	11	6	7
4	7	9	9	3
5	12	4	2	4

Feeding patterns for the two piles of kernels in each trial.

The number of individuals and the feeding pattern adopted in each trial. The feeding patterns included individuals who: (i) switched between both piles presented within the first 20 kernels; (ii) “bulk” fed eating the palatable pile of kernels in its entirety before switching to the unpalatable one; and (iii) “bulk” fed eating the unpalatable pile of kernels in its entirety before switching to the palatable one. Also reported are the number of trials that were interrupted before individuals could sample both piles of kernels. We used generalised linear mixed-effects models (GLMMs) [package “lme4”: 67] with a logit link function to account for binomial error structure to assess the effects of phenotype on task performance. Individual identity was included as a random effect in all models. To facilitate convergence, quantitative predictor variables were z-transformed to have a mean of zero and a standard deviation of 1. Preliminary analyses showed no co-variances where correlation was >0.70 between any of the fixed effects (S4 Appendix). Nevertheless, we tested each model for multicollinearity using variance inflated factors (VIFs) [package “usdm”: 68]. As some of the fixed effects had a VIF of >2.0, we did a stepwise selection from the main model until all remaining variables had VIFs <2.0. To avoid overparameterisation, backwards elimination of non-significant terms was used, until a minimal model was obtained after which eliminated variables were then added back to the final model to check they remained non-significant. We describe each of the models in turn below.

Learning process 1: Rapid learning in trial 1

The analysis evaluating kernel choices within the first trial consisted of a model that addressed our questions about (a) how individuals learnt and (b) individual characteristics associated with variation in learning. As the response variable, the model (MT1) included every kernel choice made (Palatable, 1; Unpalatable, 0) in this trial. To test for learning, we included the kernel number as a fixed effect. We predicted that learning would be demonstrated by a positive association between kernel number and the probability of consuming a palatable kernel. Additionally, we included interactions between kernel number and the sex, age, social rank and the neophilia level of individuals. A significant interaction with any of these variables would provide evidence of phenotypic trait/state-dependent learning differences. Three further fixed effects were also included: (1) individuals’ first choice of kernel in that trial (Palatable, P; Unpalatable, U), to control for those individuals that may have found it more difficult to detect a palatable kernel when tasting the bitter kernels first; (2) troop identity, to control for the possibility that baboons have an innate preference for a particular food colour; and (3) the probability of randomly selecting a palatable kernel at each choice, to account for the change in the proportion of palatable choices available as the trial progressed. This final variable was calculated as the proportion of remaining kernels that were the “palatable” choice such that at the start of the trial this proportion was 0.50 (20 of 40 kernels), and was subsequently updated with each choice that was made until no choice was available (i.e. one pile had been consumed in its entirety). When individuals consumed a pile in its entirety, they were no longer able to choose between the two options; as such, we did not consider individuals’ choices after all the kernels of one pile were eaten. When trials were interrupted or individuals left kernels uneaten, a missing value was assigned to the remaining choices that were no longer possible to make. Evidence for the first learning process, that individuals learnt the association in the first trial and remembered the association, would involve not only a positive relationship between kernel choice and number in this model but also consistently palatable choices in the subsequent trials, which are analysed in the next set of models (see below).

Learning process 2: Repeated rapid learning in trials 2–5

To test whether individuals re-learnt the association in each trial, we fitted four further models following the same model structure outlined in MT1 above for each of the subsequent trials 2–5 (MT2, MT3, MT4 & MT5).

Learning process 3: Gradual learning across trials

Gradual learning may be identified by an increasing number of palatable kernels eaten across trials, until only the palatable kernels are eaten at the start of a trial. We therefore analysed the proportion of palatable kernels eaten of the first 20 kernels in trials 2–5, as this amounts to the quantity of one pile of kernels. Trial 1 was not analysed in this sample, as this was the initial learning opportunity. The fixed effects in this model (model MT2-5) comprised trial number, to assess whether there was evidence for gradual learning; individual traits/states and their interactions with trial number, to assess whether gradual learning was predicted by individuals’ phenotypic traits/states; and two of the same additional fixed effects as in the preceding models, i.e. an individual’s first choice of kernels (in the first trial) and troop membership. In this model, we also evaluated past vigilance behaviour as a predictor, to test whether learning was negatively affected by an individual’s attention being diverted from the task. This required the inclusion of two additional fixed effects in the model, the number of vigilance instances observed in the previous trial (i.e. the vigilant behaviour of trial 1 was used as a predictor of trial 2) and the total time of each corresponding trial.

Interpretation of within-trial findings

Throughout the course of the experiment, the baboons adopted two unanticipated behaviours that complicated the interpretation of the within-trial learning results: bulk-feeding and consumption of the unpalatable kernels. Bulk-feeding resulted not only in reduced sampling of the available options at the start of the trials but also in reduced switching between options within trials, even if individuals had sampled both options within the first half of the trials. This affected the probability of choosing a palatable kernel as the trial progressed as one option was depleted continuously for an extended number of choices, even if it was the unpalatable choice. In addition, consumption of the unpalatable kernels meant that learning could be masked in the analyses. This is because individuals who chose the correct kernels first and then switched to the unpalatable kernels would, counterintuitively, show a negative probability of choosing the palatable kernels as the trial progressed, even if they had learnt the colour association and as a result were choosing the palatable option first. Because these unanticipated behaviours complicated the interpretation of the results in ways that were difficult to predict intuitively, we ran simple post-hoc scenarios of the possible outcomes (i.e. the observed relationships between kernel choice and both kernel number and the probability of randomly choosing the palatable kernel) that included the options of bulk-feeding and consumption of the unpalatable kernels to determine how we might still identify learning under these circumstances. We plotted five scenarios of the different within-trial processes, two that assumed within-trial learning and three that assumed no learning: (1) fast learning at the start of the trial; (2) slow learning throughout the trial; (3) no learning (without bulk feeding); (4) bulk-feeding on the palatable kernels at the start of the trial; and (5) bulk-feeding on the unpalatable kernels at the start of the trial. Each scenario generated a visual plot against which our observed data were compared. In the first case, individuals alternately sampled two of each kernel to learn the association and, starting from the fifth kernel, then ate the remaining palatable kernels before switching to the unpalatable kernels. In the second case, individuals started in the same manner as for fast learning, but progressively ate more of the palatable kernels while still intermittently sampling 1–3 unpalatable kernels until no correct kernels remained and the unpalatable kernels were then consumed. In the third case, the choice was random. In the fourth and fifth cases, individual sampled one unpalatable or palatable kernel, respectively, first, before bulk-feeding on the other option. All scenarios were based on observations of feeding patterns in those trials in which individuals switched between piles. Scenarios were not probabilistic and as such there is only one outcome for each scenario as described. For each scenario, we plotted the proportion of palatable kernels eaten relative to (1) the kernel number and (2) the probability of choosing the palatable kernel given the proportion of palatable kernels that remained (Fig 2). We used these plots to generate expectations of what the data would look like under each scenario against which we could compare our observed results for both trials 1 (learning process 1) and trials 2, 3, 4, and 5 (learning process 2) (Table 2).

Fig 2

Plots of the possible learning scenarios within each trial.

Caption: Shown are the relationships for the learning scenarios for: (a, b) fast learning; (c, d) slow learning; (e, f) no learning; (g, h) palatable bulk-feeding; and (i, j) unpalatable bulk-feeding. Each pair of plots shows the relationship between the proportion of correct kernels eaten and either the kernel number (left plot, in orange) or the probability of choosing a correct kernel (right plot, in green). Kernel number indicates the proportion of palatable kernels eaten in groups of five. The probability of randomly choosing the palatable kernels shown in each plot represents the median probability of palatable choices in groups of five. Note that the number of points is contingent upon how quickly the baboons completely consumed one of the piles, after which no choice was possible, and the scenario ended. Differences in the x and y-axis reflect the differences in the proportion of palatable kernels and probability based on the learning scenario proposed.

Table 2

Possible within-trial learning scenarios and simulated outcomes.

The proposed learning scenarios of the within-trial learning process and the expected directions of the effect for the fixed effects of kernel number and the probability of making the palatable choice.

Proposed scenario	Predicted effect of kernel number on the response	Predicted effect of probability of randomly choosing the palatable choice on the response
Fast learning	Positive (weak)	Negative (weak)
Slow learning	Positive (strong)	Negative (strong)
No learning	None	None
Bulk feed on palatable kernels	Positive (very weak)/none	Negative (very weak)/none
Bulk feed on unpalatable kernels	Negative/none	Negative/none

Plots of the possible learning scenarios within each trial.

Possible within-trial learning scenarios and simulated outcomes.

Ethics statement

Our research and protocols were assessed and approved by the Ethics Committee of the Zoological Society of London (BPE 727) and approved by the Ministry of Environment and Tourism in Namibia (Research Permit 2009/2015). We contacted private and public landowners directly and were given full permission to work on their land throughout the field season.

Results

We tested 38 individuals over 162 trials overall (mean number of presentations = 4.3; median = 5; range 3–5). In total, 43 trials (26%) were interrupted by displacements or supplants by more dominant animals, while 22 (13%) interruptions happened before animals could sample both piles of kernels. None of the individuals tested in each trial subset were interrupted before they had sampled both options. Trials lasted on average 15 s (range = 1–545). Compared to the pilot study where 64% of individuals left unpalatable kernels uneaten in the single presentation (see Experimental Procedure), in the current experiment only 16% of individuals (6 of 38) left unpalatable kernels uneaten in the first presentation (X = 10.38, p = 0.001). Across all uninterrupted trials (119), the baboons consumed a median of 11 palatable kernels in the first 20 kernels (range 0–20) and ate both piles of corn in their entirety in 79 (48%) uninterrupted trials. In the remaining trials, the median number of palatable kernels remaining was 15 (range 10–20), in comparison to a median number of 18.5 unpalatable kernels (range 8–20). These patterns suggest little discrimination between the palatable and unpalatable kernels. A similar pattern was seen at the individual level: among the 12 individuals who completed five uninterrupted trials, on 36 out of 47 occasions (77%) all the corn was consumed irrespective of palatability. On average, animals had 4 instances of vigilant behaviour (range = 0–18) per trial. Sniffing behaviour was observed in 21% of trials; however, preliminary tests showed no relation with the choice of kernels eaten in each trial, nor the proportion eaten across trials. In our within-trial analyses, we found no consistent evidence of associative learning in either the first trial (rapid learning, process 1) or the subsequent four trials (repeated rapid learning, process 2), due to the lack of a consistent positive relationship between correct kernel choice and kernel number (Table 3). A comparison of our results with our within-trial learning scenarios suggests our findings are more consistent with individuals using a bulk-feeding pattern after sampling with no learning (Table 4). The patterns are not consistent between trials; however, our results show baboons favoured bulk-feeding from the unpalatable pile of kernels in three out of five trials; while in only one out of five they favoured bulk-feeding on the palatable kernels. Results in the last trial (MT5) were inconsistent with any of the five learning scenarios tested. A plot of the raw data for kernel choice over time in each of the five trials (which does not control for changes in the availability of palatable kernels over time) is provided for illustrative purposes in Fig 3. Regarding evidence of across-trial learning (Process 3), this analysis only yielded a significant relationship between the proportion of palatable kernels eaten and troop membership (MT2-5, Table 3, Fig 4), where individuals in L troop were more likely to eat a higher proportion of unpalatable kernels across trials than individuals in J troop. There was no evidence of learning across trials, i.e. the proportion of palatable kernels eaten did not increase in later trials.

Table 3

Predictors of performance in wild chacma baboons in an associative learning task with coloured corn.

The three learning processes are evaluated in turn (divided by the bold line). Each model is listed by name (see text for details) and the learning process being tested; the response variable; the number of observations and individuals; the deviance; and the fixed effects of the minimal models, with their effect sizes and standard errors (coefficient and its equivalent probability in parenthesis, S.E.), test statistic (t) and p-values. Significant results with values of p < 0.05 are highlighted in bold. 1Reference category: J troop. 2Reference category: Palatable kernels.

Model	Learning Process	Response	Nobs/ Nind	Deviance	Term	Coefficient	S.E.	t	p
M_T1	1: Rapid Learning	Kernel choices in Trial 1	374 / 13	476.6	Intercept	1.02 (0.73)	0.28	3.60
M_T1	1: Rapid Learning	Kernel choices in Trial 1	374 / 13	476.6	Kernel Number	-0.02 (0.49)	0.01	-2.15	0.03
M_T2	2: Repeated Rapid Learning	Kernel Choices in Trial 2	303 / 15	287.1	Intercept	2.33 (0.91)	0.62	3.70
					Kernel Number	-0.03 (0.49)	0.02	-1.47	0.13
					Neophilia	-0.65 (0.34)	0.44	-1.46	0.14
					Troop: L¹	-1.78 (0.14)	0.79	-2.25	0.02
					Probability of correct choice	-0.69 (0.33)	0.39	-1.77	0.07
					K. Number*Neophilia	0.07 (0.51)	0.02	3.24	0.001
M_T3	2: Repeated Rapid Learning	Kernel Choices in Trial 3	327 / 14	332.9	Intercept	-1.67 (0.15)	0.65	-2.55
					Kernel Number	0.12 (0.52)	0.02	5.87	<0.001
					Troop: L¹	1.55 (0.82)	0.85	1.82	0.06
M_T4	2: Repeated Rapid Learning	Kernel Choices in Trial 4	134 / 7	141.7	Intercept	1.91 (0.87)	0.51	3.70
M_T4	2: Repeated Rapid Learning	Kernel Choices in Trial 4	134 / 7	141.7	Probability of correct choice	-2.32 (0.08)	0.67	-3.46	<0.001
M_T5	2: Repeated Rapid Learning	Kernel Choices in Trial 5	180 / 12	173.6	Intercept	-4.98 (0.006)	1.87	-2.66
					Kernel Number	0.71 (0.67)	0.19	3.73	<0.001
					First Choice: U²	3.56 (0.97)	1.79	1.98	0.04
					Probability of correct choice	12.57 (0.99)	3.35	-3.75	<0.001
M_T2-5	3: Across-trial Learning	Proportion of correct kernels in Trials 2–5	111 / 38	509.8	Intercept	2.75 (0.93)	0.84	3.26
M_T2-5	3: Across-trial Learning	Proportion of correct kernels in Trials 2–5	111 / 38	509.8	Troop: L	-2.37 (0.08)	1.17	-2.02	0.04

Table 4

Observed learning scenarios within each trial.

	Observed effect of kernel number on the response (from Table 3)	Observed effect of probability of a correct choice on the response (from Table 3)	Best fit scenario (from Table 2)
Trial 1 (M_T1)	Negative	None	Bulk feed on unpalatable kernels
Trial 2 (M_T2)	Negative	Negative	Bulk feed on unpalatable kernels*
Trial 3 (M_T3)	Positive	None	Bulk feed on palatable kernels
Trial 4 (M_T4)	None	Negative	Bulk feed on unpalatable kernels
Trial 5 (M_T5)	Positive	Positive	?

Fig 3

Plot of the relationship between the proportions of correct kernels chosen and kernel number in trials 1–5.

Caption: Bar-plot of the proportions (mean ± S.E.) of palatable kernels chosen in the first (1–10), second (11–20), third (21–30) and fourth ten (31–40) kernels consumed in (a) trial 1; (b) trial 2; (c) trial 3; (d) trial 4; and (e) trial 5. Each plot shows the raw data and therefore does not control for the changing availability of palatable kernels. The proportion of palatable kernels (0–1) in each plot was calculated as the number of palatable choices in every group of ten kernels divided by ten. If a trial was interrupted before consuming the fifth palatable kernel within a given set of ten, those data were omitted from the plot. However if the trial was interrupted when the minimum of five palatable kernels or more within a given set of ten had been consumed, the final number of palatable kernels within that set was divided by the total number of kernels in that set and included in the plot. Standard error bars are shown for each column in each plot.

Fig 4

The relationship between troop membership and the proportion of correct kernels eaten within the first 20 choices across trials 2–5.

Caption: Box-and-whisker plot of the proportions of palatable kernels chosen within the first 20 choices for each troop. Cases in which individuals who did not consume at least 20 kernels were not included in the sample analysed. The horizontal line in each box indicates the median, the box shows the lower (25%) and upper (75%) quartiles of the data, and the whiskers the minimum and maximum values. Black dots represent individual values.

Plot of the relationship between the proportions of correct kernels chosen and kernel number in trials 1–5.

The relationship between troop membership and the proportion of correct kernels eaten within the first 20 choices across trials 2–5.

Predictors of performance in wild chacma baboons in an associative learning task with coloured corn.

Observed learning scenarios within each trial.

Caption: The table shows the observed relationships in the GLMMs (from Table 3) and the inferred learning scenarios based on these relationships (from Table 2). Shown are: the trials and corresponding models; the direction of the observed main effect of kernel number on kernel choice; the direction of the observed main effect of probability to choose the palatable kernel and kernel choice; and the scenario that best fits the observed data based on the predictions generated from the learning scenarios. A question mark (?) indicates that the results obtained did not match any of the simulated learning scenarios. *The interpretation of this learning scenario is complicated due to the presence of a significant interaction in the minimum model. For the purposes of comparison with the simulation output, we consider the direction of the estimate of the main effects only. Lastly, we found little evidence of between-individual differences in task performance. There was only one model in which an effect of phenotype was detected: an interaction between neophilia and kernel number was found to influence kernel choice in trial 2 (MT2, , Fig 5), where individuals of high and low neophilia differed in their relative consumption of palatable kernels in the middle of each trial.

Fig 5

The interaction between kernel number and neophilia on kernel choice in trial 2.

Caption: Plot of the proportion (mean ± S.E.) of palatable kernels chosen in the first (1–10), second (11–20), third (21–30) and fourth ten (31–40) kernels consumed in trial 2 and the level of neophilia of individuals (High/Low). The proportion of palatable kernels (0–1) was calculated as the number of palatable choices in every group of ten kernels divided by ten. If a trial was interrupted before consuming the fifth palatable kernel within a given set of ten, those data were omitted from the plot. However if the trial was interrupted when the minimum of five palatable kernels or more within a given set of ten had been consumed, the final number of palatable kernels within that set was divided by the total number of kernels in that set and included in the plot. This plot shows the raw data and therefore does not control for the changing availability of palatable kernels. The whiskers shown on each dot represent the minimum and maximum values. For visualization purposes, neophilia level was grouped evenly into two categories, high and low according to the median value. Black circles and triangles represent individual values for high and low neophilia respectively.

The interaction between kernel number and neophilia on kernel choice in trial 2.

Discussion

We tested the associative learning abilities of individuals belonging to two groups of wild baboons with an operant conditioning task involving an association between the colour and taste of corn kernels (red/green, palatable/unpalatable) over five trials. We expected that all individuals would show an improvement in task performance as they learned the colour-taste association either within or across trials, and that certain phenotypes would show faster learning than others. However, we did not find support for any of our predictions. Overall, our results suggest that individuals bulk-fed without learning on either the palatable or unpalatable kernels in most trials. Additionally, we found limited evidence of individual differences in learning, as there was only a single significant interaction between kernel number and neophilia, where individuals with lower levels of neophilia were more likely to eat the correct kernels at the start of that trial. We also found that troop membership determined the likelihood of eating a higher proportion of correct kernels across trials. An animal’s fitness may depend on its ability to make rapid associations regarding novel foods [69], as animals must not only determine the safety of those foods but also whether they are nutritionally rewarding [70,71]. Previous studies with chacma baboons provided further support to our expectation that baboons would concisely and rapidly learn the taste-colour association. For example, captive baboons were able to solve complex learning tasks using an automated device [72], while wild baboons are capable of rapidly learning the location of valuable food items [e.g. 73,74]. Although we found no evidence for learning, it is possible that the baboons learnt there was a difference in taste between both piles of kernels (i.e. one was bitter while the other was not) but were still willing to eat the bitter kernels. This interpretation is supported by our observation that the baboons left slightly more bitter kernels uneaten. Generalist species such as baboons can adapt quite successfully to situations involving novel-flavoured foods [75]. This may be in part because they have relatively low gustatory sensitivity and can readily incorporate novel foods into their diet even when these are unpalatable to other species [76]. Captive studies indicate that tolerance to bitterness varies among primate species, particularly to compounds not found in nature, such as the bitter substance used here (denatonium benzoate) [77]. An additional aspect in this study is that it was conducted during a severe drought year, which likely further influenced the test subjects’ willingness to accept the bitter food presented. Chacma baboons elsewhere in the Namib Desert have been reported to feed on unpalatable toxic plants that they would normally avoid, such as Euphorbia avas-montana, during drought conditions [78]. The observation that the baboons showed a stronger rejection of the bitter kernels during the pilot study, when environmental conditions were less severe than during the main study conducted five months later, supports this interpretation. Additionally, it is possible that the acceptance of bitterness was reinforced by the baboons’ previous experience with kernels, which represented a nutritious and safe source of food. Our analyses of learning across trials revealed that individuals from J troop, in which the palatable kernels were red, tended to eat a higher proportion of these kernels within their first 20 choices than their L troop conspecifics. Such a result may be indicative of a species’ selective attention towards red food items. The ripening of fruit from green to red correlates with changes in their glucose levels, indicating their taste quality and nutritional value [31]. Moreover, while the normal diet of both troops includes large amounts of immature green pods, leaves and stalks, individuals who were tested could have had a natural preference for the red kernels as this colour represents a key seasonal fruit in this environment: the ripe winter berries of Salvadora persica. Since we were avoiding any potential habituation or possible generalisation between colours, we did not test for a specific colour preference to exclude; however, as both colours used in this task represented food items frequently eaten, and thus, ecologically relevant to learn, it was not obvious at the time that the baboons would have a preference for either colour. The fact that this result was only inconsistently observed within trials suggests that the preference may be a relatively weak one (perhaps unsurprisingly, given foods of both colour occur in the natural diet), and was therefore best captured by sampling individual preferences over the larger range of choices analysed in the across-trials model. Some unexpected patterns were observed in the within-trial analysis, in particular, the failure of the results from trial 5 to correspond with any of the simulated learning scenarios, and the interaction between neophilia and kernel number that was contrary to prediction in trial 2. In the first case, the characteristics of the subset may be responsible: an unusually high proportion of trials were interrupted in this final trial (7 out of 15 trials) and early and late interruptions (i.e. within the first 10 and 20 kernels respectively) were not considered in the learning scenarios. In the second case, we found that more neophilic and less neophilic individuals differed in their relative consumption of correct kernels in the middle of this trial. However, because this was only observed in a single trial, and after a prior presentation, it is difficult to interpret. Given the number of interactions we tested, and our decision not to apply Bonferroni corrections [79], the most parsimonious interpretation may be that this result represents a spurious relationship. Our experience in developing and conducting this study reflects some of the challenges involved in devising cognitive tasks that efficiently assess cognitive abilities and suit the species under study, particularly in wild conditions [80,81]. The pilot tests we conducted saw a majority of individuals avoiding the maize after tasting the bitter kernels, suggesting they were sensitive to such a substance. However, it is possible that differences in the abundance of natural foods during the pilot study, which was conducted over a period of summer rains, and during the experiment, which was conducted over a period of winter drought, resulted in the latter differing substantially from what was observed in the former. Despite substantial literature detailing the influence of unfavourable environment on animals’ cognitive abilities [e.g. 82,83] and cue use [e.g. 35], there is little empirical evidence on the phenotypic plasticity of labile traits related to cognition. Consequently, we have a poor understanding of cognitive variability in fluctuating, rather than predictable and constant, environments. Future studies could consider differences in learning behaviour relative to the resources available in the environment at different points in time. Additionally, the unanticipated performance of bulk feeding behaviour made this standard laboratory experiment difficult to interpret. Because the baboons bulk fed from one pile of corn at a time, few individuals sampled both colours of corn at the start of a trial. Because of this, we were forced to analyse a smaller sample, which reduced the statistical power to detect an effect or pattern. Ultimately, our study highlights the importance of using the right task to assess cognitive abilities, taking into account not only the natural behaviour of animals, but also their current environmental conditions to understand how abilities such as associative learning develop in a natural setting.

Demographic breakdown of all identifiable individuals in both study troops.

(DOCX) Click here for additional data file.

The numbers of baboons tested according to sex, age and troop in the study.

Shown are the total numbers of baboons tested of each age-sex class in each troop, including the total. The percentage of the population that the sample represents is presented in brackets. (DOCX) Click here for additional data file.

The numbers of baboons tested according to dominance rank and neophilia level in the study.

Shown are the total numbers of baboons tested of each dominance rank and neophilia level, including the total. The percentage of the population that the sample represents is presented in brackets. For the purposes of this table, dominance ranks were grouped evenly into categories of “low-rank”, “medium-rank” and “high-rank” according to tertiles; while neophilia levels was grouped evenly into categories of “low-neophilia”, “medium-neophilia” and “high-neophilia” according to tertiles. (DOCX) Click here for additional data file.

Spearman rank correlation coefficients of the predictor variables used in the GLMM models.

Shown are the Spearman correlation coefficients of the predictor variables used in the GLMM models. Sample size is N = 38 individuals in all cases. Individual vigilance and total time were calculated as the median across all trials (1–5). First choice refers to the first choice between each pile of corn in the first trial (Palatable, P; Unpalatable, U). (DOCX) Click here for additional data file. 12 Sep 2019 Submitted filename: Reviewer comments_FINAL.docx Click here for additional data file. 25 Nov 2019 PONE-D-19-25653 Exploring individual variation in associative learning abilities through an operant conditioning task in wild baboons PLOS ONE Dear Dr. Carter, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Unfortunately, Reviewer 1 was no longer available, and a new reviewer was invited. Both reviewers found that your study is valuable and that performing cognitive studies in the wild is extremely challenging, and I share their view. However, whereas Reviewer 1 (previously Reviewer 2) was quite happy with your revision, the current Reviewer 2 asked for further work on the paper, before it can be considered acceptable. Please address the comments provided by both reviewers, along with my suggestions, listed below. We would appreciate receiving your revised manuscript by Jan 09 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Elsa Addessi Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide. 3. Please include a copy of Table 2 which you refer to in your text on page 23. Additional Editor Comments (if provided): ll 39-40: please modify as follows: “current environmental challenges associated with conducting cognitive tests of animals in the wild” Ll 494-495 “We also found that troop membership determined the likelihood of eating a higher proportion of correct kernels across trials.”: couldn’t it be done to color preference? Did you test color preference before administering the differently flavored kernels? Although I appreciated the answers you provided in reply to the comments of the previous reviewers on this issue, this aspect should be accounted for in the Discussion. L 529 “and the interaction that was contrary to prediction in trial 2”: please remind the reader what is the interaction you are mentioning here L 533: “that” repeated twice (typo) [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Partly ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: No ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: General comments This study aimed to assess associative learning performance of individuals in two groups of wild baboons. The manuscript represents a significant amount of work and while it is unfortunate that the authors were not able to detect learning, I believe the trial was thoughtfully conducted. There remains a paucity of studies investigating cognition in wild populations (it is challenging!) and studies such as the current one are useful building blocks for future attempts. The authors have done a nice of addressing the concerns/ suggestions raised in the previous two reviews, as such I only have minor comments. Minor comments: 1. L51 – As there are only a small number of studies that have looked at the fitness consequences of associative learning (adaptive value) in natural environments, it would be good to cite them all here. Also, as the authors correctly point out, differences in learning are likely to reflect adaptions. The only study currently cited reports a positive correlation between learning performance and fitness correlates, but two other studies report negative relationships: Ashton B.J., Ridley A.R., Edwards E.K., Thornton A. 2018 Cognitive performance is linked to group size and affects fitness in Australian magpies. Nature 554 (7692), 364-367 (doi:10.1038/nature25503). Madden JR, Langley EJG, Whiteside MA, Beardsworth CE, van Horik JO (2018) The quick are the dead: pheasants that are slow to reverse a learned association survive for longer in the wild. Philosophical Transactions of the Royal Society B: Biological Sciences 373 Evans LJ, Smith KE, Raine NE (2017) Fast learning in free-foraging bumble bees is negatively correlated with lifetime resource collection. Scientific Reports 7:496 2. L52 – This sentence should be modified to improve clarity. I suggest inserting ‘while ultimately differences in…’ removing also from ‘individuals within a species also differ…’ and including ‘genotype or epigentic changes due to the developmental trajectory environment experienced during their lifetime. 3. L85 – Incorrect references inserted, should be references ‘3’ (bumble bees) and ‘4’ (humingbirds). It doesn’t seem as though reference ‘5’ has been used at all – so it would pay to double check all references align as intended. 4. L86 – Rather than including a second reference for bumble bees can you reference a study that has looked at associative colour learning in foraging mammals? Or a least a different taxon e.g. butterflies, jumping spiders. 5. Discussion – it would be good to briefly comment on baboon cognition generally. What would you expect to see in terms of their associative learning performance, based on their performance in other cognition assays, conducted either with captive or wild individuals? 6. L552 - It would be good to reinforce that studying cognition in wild populations is generally very challenging by citing the below studies: Huebner F, Fichtel C, Kappeler PM (2018) Linking cognition with fitness in a wild primate: fitness correlates of problem-solving performance and spatial learning ability. Philosophical Transactions of the Royal Society B: Biological Sciences 373 Morand-Ferron J., Cole E.F., Quinn J.L. 2016 Studying the evolutionary ecology of cognition in the wild: a review of practical and conceptual challenges. Biological Reviews 91(2), 367-389. (doi:10.1111/brv.12174). Reviewer #2: Note: I have not reviewed the original version of this manuscript submitted to PLOS ONE. This is an interesting study on individual differences in wild baboons’ abilities to learn a two-choice color/food discrimination. I applaud the authors for their careful efforts to conduct individual cognitive tests with wild, group-living animals. Although it is not clear what the baboons had learned, the finding that even seemingly simple behavior is strongly influenced or constrained by animals’ natural environment is important. Indeed, such factors should be more frequently considered in studies of captive animals as well. However, I have some concerns and suggestions about how to effectively present and analyze the data. INTRODUCTION This section provides a nice exposition but the motivation for the study is not obvious to the reader. Yes, we know little about this topic in the wild, but you could make clearer why people should want to study learning (and between- and within-species variability) in the first place. There is a gap, but why do we want to fill it? What we can learn from this more broadly? Lines 69-72: I’d add that, on the other hand, captive animals may also “fail” on problems they’d never encounter in the wild, which can distort our estimate of that species’ cognitive abilities in the other direction. Lines 111-113: Couldn’t you argue the opposite, that males would need to learn faster because they encounter new environments? But in general, how stable are these baboons’ environments outside of seasonal variability? Even if they disperse and go to a neighboring group, aren’t they still living in the same general habitat with the same food sources? Line 122: “because of their greater experience” I don’t fully follow this point. Greater experience with what? With that particular food item, with learning associations, something else ..? Lines 134-135: How can they be independent and considered opposites? Line 140-141: Why? If the idea is that neophilic individuals spend more time with the task (or would perhaps be less vigilant?) and therefore are more likely to learn, I suggest adding this explicitly – and then since you have the data, you should also test it statistically. Line 142: Please add a brief definition for vigilance. METHODS The authors collected a substantive data set including food choice in the learning task and individual differences in sex, rank, neophilia, age, and vigilance in prior trials. However, several reasons make it difficult to fully appreciate this data set. There are no indicators of reliability (except for a previous repeatability estimate for neophilia, which seems low to rely on a single measurement here). Line 221: When trials were interrupted, did that always end the trials? If so, please state. In lines 312-313, you say that a trial “was considered finished” when all kernels of one pile were eaten. 1) This information needs to be much earlier (move it into the Experimental Procedure section). 2) Did the experimenter remove the other pile at this point? Or was it just considered finished for data recording, but the baboon could still eat the other pile? Line 243: “(2) the colour of the first ten kernels consumed” – should this really be 10? not “all”? Lines 265-267: I’d add that this means all cases where they didn’t eat one pile in its entirety (20 kernels) first. It’s implied from having 20 kernels in each pile and you say this later in the manuscript, but it would be helpful for the reader here too. Table 1: The sums by row sum to trial 1-5: 34, 37, 38, 38, 32. Why can the number of individuals for trial 1 be lower than subsequent trials? And if the grand total here is 179 and interruptions were 22, does that not contradict the descriptives in lines 400-401 and 531? Table 1: But what is the proportion when they switched within the first 20 trials? E.g., 19 vs. 1 would still essentially be bulk feeding. The cutoff could be arbitrary, of course, but the distribution seems important because you use this criterion to subset your data. The number of observations is extremely variable, in part due to the natural distribution of traits in this population, in part due to practical constraints during data collection, and in part due to subsetting for statistical analyses. This is not a problem per se but could be presented in a more organized fashion. The eventual reader could consult the full data set, but making it available for review as well would be extremely helpful in evaluating the analytical approach. Line 284: You tested for multicollinearity but please add something like a correlation matrix or crosstabs to indicate which of these factors covaried. That’s interesting in its own right but also important to assess your models. Appendix S3: How/why were these divided into low, medium, and high? Aren’t these both continuous variables? --Models-- The analytical approach is confusing. Couldn’t you combine Models T1, T2, …, and T5 into a single model and add trial number as a fixed effect and with interactions, like in Model T2-5? If they only learned in trial 1 and then only chose palatable trials, you’d see an interaction between trial and kernel number. If they relearned the association every time (at the same rate), you wouldn’t expect an interaction. I also still don’t understand why you couldn’t use the full data set here too? Including the current proportion of palatable/unpalatable kernels as a fixed effect seems odd because it’s itself dependent on prior choices, which are the measure of interest. I understand the motivation to account for this change in proportion, but this seems statistically unsound. --Simulations-- I like this approach a lot (running simulations of the proposed processes to see what the data should look like), but this needs some work. 1. Why not also do this for between-trial learning? 2. Please provide more detail about how the simulations work. E.g., how many runs per scenario? What were the probabilities? Scenario 1: What kernel number is meant by “then” (line 362)? Scenario 2: What precisely is meant by “intermittently” (line 365) and how was it determined whether 1, 2, or 3 kernels were sampled? 3. Does “baseline probability” mean the ratio of remaining palatable/(palatable + unpalatable)? If so, “baseline” may not be the best word because it suggests the initial probability (i.e., 50%) and could therefore cause confusion. 4. Importantly, if you simulate these processes for 38 baboons, and run the same analyses that you run on the actual data (minus individual variability), can you extract these patterns? It’s not clear whether that is how you derived Table 2 (if so, state so explicitly and add the actual parameter estimates). Given the multicollinearity concerns and criticisms of stepwise regressions, I agree with the previous Reviewer that a model selection/information theoretic approach might be more appropriate (Burnham & Anderson, 1998). It also lets you avoid overfitting, but in an arguably more principled manner (specifically, by including a penalty term for each added parameter). RESULTS Some descriptives (medians, ranges) for neophilia, vigilance, time spent on the task would be nice. Lines 405-406: How many kernels did they leave uneaten in subsequent trials? Table 3: This might change depending on changes to your analytical approach, but it’s hard to extract what probabilities/odds ratios for kernel choice these models actually predict. This would be much aided by a plot with the model fit overlaid (see also below). FIGURES The current figures do not convey a lot of information and only in aggregated form. And the arguably most useful figure (S4) to get a quick sense of the baboons’ behavior in the learning task is hidden in the supplemental material. The figures could be used to much greater effect to include e.g. a line + uncertainty band to indicate model fit, the number of observations (e.g., as labels or proportional to point size) or individual points/trajectories (e.g., in semi-transparent, small points or thin lines) having Fig. S4 in the same format as Fig. 2 (simulations) would also really help comparison to the different learning processes. Fig. 2 should have the same x- and y-axis scales throughout, to aid comparison (i.e., kernel size always from 1-5 to 36-40, baseline probability always from 0 to 1, proportion palatable always from 0 to 1). Fig. 4 should have ranges as the labels (i.e., 1-10 instead of “10” etc.), also make the y go from 0 to 1 Fig. 4 how were low and medium determined? Also, isn’t this a continuous variable? If the categorization was just for visualization purposes, please state so in the caption. DISCUSSION Assuming that the general finding holds (no evidence of learning as measured by [exclusive] preference of palatable kernels, let alone individual differences), this seems fine. I appreciate the discussion of how things can go awry in the field. Two minor points to perhaps discuss in more depth: The novel food wasn’t really novel, as the baboons had all eaten corn before. Just the color and bitter taste were new. To what extent do you think their previous experience/learning explains why they didn’t learn (more) here? If they already knew that it was nutritious and not poisonous, they may well have learned that one tasted bitter, but that might have simply not been enough to outweigh a free meal (especially when food is scarce, as you mention). Is there any other evidence in the literature that individual differences play less of a role during times of food scarcity because everybody is similarly limited in what they can do? If so, that may be worth including. TYPOS, WORDING SUGGESTIONS, & REFERENCES: The use of “incorrect/correct” kernels sounds a bit awkward and seems unnecessary. I suggest simply using “unpalatable/palatable” throughout the manuscript. Please carefully check your references. Some refs listed in the list are not listed in the text (e.g., [5, 36, 63, 64]) and refs [58] and [59] are identical. Line 35: should be “individuals’ phenotypes” (with an s) Line 47: should be “aspects such as foraging behavior” (add “as”) Lines 90, 93, 95: I’d recommend italics instead of all-caps to emphasize the either/or’s Line 110: better “female baboons would” (instead of “will”) Line 203: should be “prior experience with corn kernels” (instead of “of”) Line 388: should be Table 2 (instead of 1) ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. 30 Jan 2020 Response to reviewers We would like to thank the Academic Editor and two anonymous reviewers for their helpful comments and suggestions. Our responses below are presented on a point-by-point basis (and included in a file attached). When relevant, we include the line numbers that correspond to changes made in the manuscript version without track changes. General comments 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf This has been checked 2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide. We thank the Editor for pointing this out. Our data has been uploaded to the Figshare repository (10.6084/m9.figshare.9785867). 3. Please include a copy of Table 2 which you refer to in your text on page 23. A copy of Table 2 has been included in both manuscripts (track and no track). Additional Editor Comments 1. 39-40: please modify as follows: “current environmental challenges associated with conducting cognitive tests of animals in the wild” This has been changed. 2. L 494-495 “We also found that troop membership determined the likelihood of eating a higher proportion of correct kernels across trials.”: couldn’t it be done to color preference? Did you test color preference before administering the differently flavored kernels? Although I appreciated the answers you provided in reply to the comments of the previous reviewers on this issue, this aspect should be accounted for in the Discussion. We have added the following text to the Discussion to account for the point raised by the reviewer: “Since we were avoiding any potential habituation or possible generalisation between colours, we did not test for a specific colour preference to exclude; however, as both colours used in this task represented food items frequently eaten, and thus, ecologically relevant to learn, it was not obvious at the time that the baboons would have a preference for either colour.” (L568-572) 3. L 529 “and the interaction that was contrary to prediction in trial 2”: please remind the reader what is the interaction you are mentioning here Done. The line now reads: “the interaction between neophilia and kernel number that was contrary to prediction in trial 2”. 4. L 533: “that” repeated twice (typo) The additional word has been removed Reviewer #1 General comments This study aimed to assess associative learning performance of individuals in two groups of wild baboons. The manuscript represents a significant amount of work and while it is unfortunate that the authors were not able to detect learning, I believe the trial was thoughtfully conducted. There remains a paucity of studies investigating cognition in wild populations (it is challenging!) and studies such as the current one are useful building blocks for future attempts. The authors have done a nice of addressing the concerns/ suggestions raised in the previous two reviews, as such I only have minor comments. Minor comments 1. L51 – As there are only a small number of studies that have looked at the fitness consequences of associative learning (adaptive value) in natural environments, it would be good to cite them all here. Also, as the authors correctly point out, differences in learning are likely to reflect adaptions. The only study currently cited reports a positive correlation between learning performance and fitness correlates, but two other studies report negative relationships: Ashton B.J., Ridley A.R., Edwards E.K., Thornton A. 2018 Cognitive performance is linked to group size and affects fitness in Australian magpies. Nature 554 (7692), 364-367 (doi:10.1038/nature25503). Madden JR, Langley EJG, Whiteside MA, Beardsworth CE, van Horik JO (2018) The quick are the dead: pheasants that are slow to reverse a learned association survive for longer in the wild. Philosophical Transactions of the Royal Society B: Biological Sciences 373 Evans LJ, Smith KE, Raine NE (2017) Fast learning in free-foraging bumble bees is negatively correlated with lifetime resource collection. Scientific Reports 7:496 We have now added two additional references of studies that directly evaluate the fitness consequences of associative learning. Here, we also make sure to point out that there are exceptions to our statement, in particular, that differences in learning likely reflect adaptations by citing the references suggested above (L51-52). 2. L52 – This sentence should be modified to improve clarity. I suggest inserting ‘while ultimately differences in…’ removing also from ‘individuals within a species also differ…’ and including ‘genotype or epigentic changes due to the developmental trajectory environment experienced during their lifetime. This has been changed to: “While ultimately differences in associative learning abilities between species are likely to reflect adaptation, inter-specific differences are also likely to reflect genotype or epigenetic changes dependent on developmental trajectory, and the environment experienced during their lifetimes” (L51-55). 3. L85 – Incorrect references inserted, should be references ‘3’ (bumble bees) and ‘4’ (humingbirds). It doesn’t seem as though reference ‘5’ has been used at all – so it would pay to double check all references align as intended. References have been checked and corrected. 4. L86 – Rather than including a second reference for bumble bees can you reference a study that has looked at associative colour learning in foraging mammals? Or at least a different taxon e.g. butterflies, jumping spiders. We have added new references related to studies with butterflies and birds (L86-88). 5. Discussion – it would be good to briefly comment on baboon cognition generally. What would you expect to see in terms of their associative learning performance, based on their performance in other cognition assays, conducted either with captive or wild individuals? We have added a few sentences that lay out our expectations in reference of baboons’ learning abilities in wild and captive conditions: “Previous studies with chacma baboons provided further support to our expectation that baboons would concisely and rapidly learn the taste-colour association. For example, captive baboons were able to solve complex learning tasks using an automated device; while wild baboons are capable of rapidly learning the location of valuable food items”. (L537-541). 6. L552 - It would be good to reinforce that studying cognition in wild populations is generally very challenging by citing the below studies: Huebner F, Fichtel C, Kappeler PM (2018) Linking cognition with fitness in a wild primate: fitness correlates of problem-solving performance and spatial learning ability. Philosophical Transactions of the Royal Society B: Biological Sciences 373 Morand-Ferron J., Cole E.F., Quinn J.L. 2016 Studying the evolutionary ecology of cognition in the wild: a review of practical and conceptual challenges. Biological Reviews 91(2), 367-389. (doi:10.1111/brv.12174). These references have been added (L591). Reviewer #2 General Comments This is an interesting study on individual differences in wild baboons’ abilities to learn a two-choice color/food discrimination. I applaud the authors for their careful efforts to conduct individual cognitive tests with wild, group-living animals. Although it is not clear what the baboons had learned, the finding that even seemingly simple behavior is strongly influenced or constrained by animals’ natural environment is important. Indeed, such factors should be more frequently considered in studies of captive animals as well. However, I have some concerns and suggestions about how to effectively present and analyze the data. INTRODUCTION This section provides a nice exposition but the motivation for the study is not obvious to the reader. Yes, we know little about this topic in the wild, but you could make clearer why people should want to study learning (and between- and within-species variability) in the first place. There is a gap, but why do we want to fill it? What we can learn from this more broadly? 1. Lines 69-72: I’d add that, on the other hand, captive animals may also “fail” on problems they’d never encounter in the wild, which can distort our estimate of that species’ cognitive abilities in the other direction. We have included the reviewer’s suggestion, but we note that captive animals normally go through long period of training to habituate them to the novelty of ecologically-irrelevant tasks. The additional text reads as follows: “Conversely, the failure of captive animals to solve tasks that use unfamiliar stimuli may equally distort our estimate of that species’ cognitive abilities”. (L72-73) 2. Lines 111-113: Couldn’t you argue the opposite, that males would need to learn faster because they encounter new environments? But in general, how stable are these baboons’ environments outside of seasonal variability? Even if they disperse and go to a neighboring group, aren’t they still living in the same general habitat with the same food sources? We agree with the reviewer and had debated this amongst ourselves. We have now deleted this justification for our prediction, and instead present only the second, stronger prediction. We did not imply that males would not have eventually learnt the association themselves, but rather that females may do so more quickly because they have a greater need to do so. 3. Line 122: “because of their greater experience” I don’t fully follow this point. Greater experience with what? With that particular food item, with learning associations, something else ..? We have now added: “identifying changes in food items with regards to their colour and taste” to better explain what we mean (L121-122). 4. Lines 134-135: How can they be independent and considered opposites? We have re-worded the sentences to clarify our meaning. The sentence now reads: “Although both responses are sometimes considered opposite ends of the same personality continuum, they are independent of one another” (L134-136). 5. Line 140-141: Why? If the idea is that neophilic individuals spend more time with the task (or would perhaps be less vigilant?) and therefore are more likely to learn, I suggest adding this explicitly – and then since you have the data, you should also test it statistically. No, as we explain in the previous sentence, “neophilia … is associated with greater innovation and successful problem-solving as individuals are more likely to explore a novel situation”. To clarify, we have now changed the wording to: “Based on these previous findings, …”(L140-141) 6. Line 142: Please add a brief definition for vigilance. We have added a definition (L142-143). METHODS The authors collected a substantive data set including food choice in the learning task and individual differences in sex, rank, neophilia, age, and vigilance in prior trials. However, several reasons make it difficult to fully appreciate this data set. 7. There are no indicators of reliability (except for a previous repeatability estimate for neophilia, which seems low to rely on a single measurement here). The repeatability of neophilia is in line with average repeatability for behaviour (Bell et al., 2009), and single measurements were made to decrease the number of times the baboons were presented with stimuli from the observers. This single within-season measurement appears to be biologically meaningful, predicting, for example, the way individuals use personal and social information during foraging (e.g. Carter et al. 2014). Regarding other behaviours that were quantified from the videos, each video was processed by one of the authors (CML) at least three times to quantify kernel choice, number of kernels and number of vigilance events. In cases when the measurement varied from one coding to the next, we re-coded the behaviour once more. We consider these measurements to be reliable, as there was little ambiguity. To address this, we now included a small paragraph in the “Experimental protocol” section, defining sources of potential ambiguity and noting the steps taken to address such irregularities. Bell, Alison M., Shala J. Hankison, and Kate L. Laskowski. "The repeatability of behaviour: a meta-analysis." Animal behaviour 77.4 (2009): 771-783. Carter, Alecia J., et al. "Personality predicts the propensity for social learning in a wild primate." PeerJ 2 (2014): e283. 8. Line 221: When trials were interrupted, did that always end the trials? If so, please state. We have added this information (L240-242). 9. In lines 312-313, you say that a trial “was considered finished” when all kernels of one pile were eaten. 1) This information needs to be much earlier (move it into the Experimental Procedure section). 2) Did the experimenter remove the other pile at this point? Or was it just considered finished for data recording, but the baboon could still eat the other pile? This information was added in the Statistical Analysis section as it relates to how we selected the data under analysis, that is, that in a given trial, all observations after the first pile was consumed entirely were not considered for the given reasons. To clarify this, we have changed the wording to: “we did not consider individuals’ choices after all the kernels of one pile were eaten” (L331-332). We were unable to remove the remaining pile of kernels while the trial was ongoing for risk of aggression from the baboons. 10. Line 243: “(2) the colour of the first ten kernels consumed” – should this really be 10? not “all”? We thank the Reviewer for pointing this out. We have changed accordingly. 11. Lines 265-267: I’d add that this means all cases where they didn’t eat one pile in its entirety (20 kernels) first. It’s implied from having 20 kernels in each pile and you say this later in the manuscript, but it would be helpful for the reader here too. We have added this information (L291). 12. Table 1: The sums by row sum to trial 1-5: 34, 37, 38, 38, 32. Why can the number of individuals for trial 1 be lower than subsequent trials? And if the grand total here is 179 and interruptions were 22, does that not contradict the descriptives in lines 400-401 and 531? Table 1: But what is the proportion when they switched within the first 20 trials? E.g., 19 vs. 1 would still essentially be bulk feeding. The cutoff could be arbitrary, of course, but the distribution seems important because you use this criterion to subset your data. Although we tested 37 animals with a first trial, we were unable to extract data from three recordings, as the camera malfunctioned in these cases, hence the difference in sample size between trial 1 and the rest of the trials. We have a total of 162 trials, out of which we were able to extract data from 159 (the total number of trials on the table). The interruptions listed on the table represent interruptions that happened before animals could sample both piles. We have added a line to the results to specify the difference between overall interruptions and interruptions which happened before both piles of kernels were samples (L421-423). The number of interruptions described in L531 corresponds to the interruptions that happened after both piles were sampled, but which nonetheless affected the patterns under analysis. In Table 1, we included all individuals who had sampled both colours before consuming one pile of kernels entirely, regardless of the proportion when they switched, as we believed this offered more consistency; while the cutoff was decided only when a single pile of kernels remained. 13. The number of observations is extremely variable, in part due to the natural distribution of traits in this population, in part due to practical constraints during data collection, and in part due to subsetting for statistical analyses. This is not a problem per se but could be presented in a more organized fashion. The eventual reader could consult the full data set, but making it available for review as well would be extremely helpful in evaluating the analytical approach. Our dataset is available in an open repository and we have provided the DOI with this resubmission to access it. 14. Line 284: You tested for multicollinearity but please add something like a correlation matrix or crosstabs to indicate which of these factors covaried. That’s interesting in its own right but also important to assess your models. We have added a covariance table in the supplementary material section as requested. 15. Appendix S3: How/why were these divided into low, medium, and high? Aren’t these both continuous variables? Yes, these variables were continuous, but for the purposes of ease-of-assessment for the reader, in this table each variable was separated evenly into three categories according to tertiles (bottom third, middle third, high third). We have now added more detail in our description of the Appendix ( L846-849). --Models-- The analytical approach is confusing. Couldn’t you combine Models T1, T2, …, and T5 into a single model and add trial number as a fixed effect and with interactions, like in Model T2-5? If they only learned in trial 1 and then only chose palatable trials, you’d see an interaction between trial and kernel number. If they relearned the association every time (at the same rate), you wouldn’t expect an interaction. I also still don’t understand why you couldn’t use the full data set here too? We had initially combined all models and as the reviewer suggests and had added trial number as a fixed effect. However, this approach was difficult to interpret and gave us very limited information about differences in animals’ learning within each trial. To understand whether and how within-trial learning occurred, we would have to assess each model using post-hoc tests in the manner we have now done. Consequently, we decided to evaluate each trial separately. Moreover, since the simulations we proposed predicted several possible patterns, a single model would have failed to capture this. Including the current proportion of palatable/unpalatable kernels as a fixed effect seems odd because it’s itself dependent on prior choices, which are the measure of interest. I understand the motivation to account for this change in proportion, but this seems statistically unsound. We have given this quite some thought. To reiterate, our rationale for using this variable was to account for the palatable kernels still available as a trial progressed, since individuals would be more likely to choose randomly an unpalatable kernel as the number of the palatable kernels decreased and vice versa. Not controlling for this could lead to spurious conclusions e.g. that the baboons showed learning as the trial progressed, but that this was just a function of randomly choosing many unpalatable kernels to begin with, before having few unpalatable options to choose and thus making “correct” choices as the trial progressed. We understand the reviewer’s concern, but we believe that it is important to control statistically for the change in the underlying probability of randomly choosing a correct kernel. --Simulations-- I like this approach a lot (running simulations of the proposed processes to see what the data should look like), but this needs some work. 1. Why not also do this for between-trial learning? The predictions for between-trial learning were clearer than those for within-trial learning, in large part because (a) this is the more standard approach in this field and (b) bulk feeding was not an issue for between-trial learning (as we did not interrupt trials after the first pile had been eaten). 2. Please provide more detail about how the simulations work. E.g., how many runs per scenario? What were the probabilities? Scenario 1: What kernel number is meant by “then” (line 362)? Scenario 2: What precisely is meant by “intermittently” (line 365) and how was it determined whether 1, 2, or 3 kernels were sampled? The simulated scenarios were developed post hoc based on our observations of the feeding patterns seen throughout trials, in particular, the bulk feeding. The simulations were used to generate patterns with which to compare our observed data. As such, these are not iterative statistical simulations. We have made some additions in the text to clarify this, and now refer to “scenarios” c.f. “simulations”. In particular, we have added the following text: “Scenarios were not probabilistic and as such there is only one outcome for each scenario, as described.” (L390-391). 3. Does “baseline probability” mean the ratio of remaining palatable/(palatable + unpalatable)? If so, “baseline” may not be the best word because it suggests the initial probability (i.e., 50%) and could therefore cause confusion. Yes, the baseline probability referred to the probability of choosing a correct kernel given the numbers of kernels that were available for each choice. To avoid any confusion, we have removed the term from the manuscript. 4. Importantly, if you simulate these processes for 38 baboons, and run the same analyses that you run on the actual data (minus individual variability), can you extract these patterns? It’s not clear whether that is how you derived Table 2 (if so, state so explicitly and add the actual parameter estimates). As mentioned above, we did not conduct statistical, iterative simulations, but rather developed each scenario based on our observations of the feeding patters of the baboons. We hope the new edits make this clearer. Given the multicollinearity concerns and criticisms of stepwise regressions, I agree with the previous Reviewer that a model selection/information theoretic approach might be more appropriate (Burnham & Anderson, 1998). It also lets you avoid overfitting, but in an arguably more principled manner (specifically, by including a penalty term for each added parameter). Regarding the concerns the reviewer highlights, we controlled for multicollinearity through VIFs as suggested by Zuur et al. [1] and have provided a correlation matrix showing Spearman rank correlations between all fixed effects variables. Regarding the criticisms of stepwise model selection, disciplined hypothesis testing for small amounts of model reduction, which we do here, is considered as appropriate [2]. Given that we controlled for both issues the reviewer mentions, and that post-analysis changes to modelling approaches are also criticised, we are reticent to change our analysis at this stage. RESULTS 1. Some descriptives (medians, ranges) for neophilia, vigilance, time spent on the task would be nice. The median and range for neophilic behaviour has been added in L175. Median and range for vigilance and time spent on the task were added on L435-436 and L424 respectively. 2. Lines 405-406: How many kernels did they leave uneaten in subsequent trials? Since we were comparing the pilot study to the current study, we focused only on the first trial, as any kernels left uneaten in subsequent trials would have been left on account of potential learning from previous trials and would not have been a fair comparison to the pilot study. Table 3: This might change depending on changes to your analytical approach, but it’s hard to extract what probabilities/odds r atios for kernel choice these models actually predict. This would be much aided by a plot with the model fit overlaid (see also below). We have included in Table 3 a conversion of the estimate (in logit scale) to probability. We hope this offers further information and clarity on our models. We have also modified our figures to incorporate the reviewer’s suggestions (see our answer to comment 1 FIGURES below). FIGURES 1. The current figures do not convey a lot of information and only in aggregated form. And the arguably most useful figure (S4) to get a quick sense of the baboons’ behavior in the learning task is hidden in the supplemental material. The figures could be used to much greater effect to include e.g. a line + uncertainty band to indicate model fit, the number of observations (e.g., as labels or proportional to point size) or individual points/trajectories (e.g., in semi-transparent, small points or thin lines) having Fig. S4 in the same format as Fig. 2 (simulations) would also really help comparison to the different learning processes. We have added the figure previously depicted on S4 to the main text (now Fig 3) as the reviewer suggested and have added the number of observations as labels. We have additionally modified the other two figures (Fig 4 & 5) to include individual points. 2. Fig. 2 should have the same x- and y-axis scales throughout, to aid com0parison (i.e., kernel size always from 1-5 to 36-40, baseline probability always from 0 to 1, proportion palatable always from 0 to 1). We had originally plotted the graphs as the reviewer has suggested. However, these plots are merely visual aids of what the patterns of consumption of kernels would look like given our within-trial learning scenarios. Altering the x-axis to include all kernels (i.e. 1-5 to 36-40), affected the visualization of the predicted effect of each scenario, particularly in scenarios 3 and 5, where changing the scales of both axes severely impacted the visualization and interpretation of the plots. We ultimately decided to change this to enable a clearer visualization of the possible trends. 3. Fig. 4 should have ranges as the labels (i.e., 1-10 instead of “10” etc.), also make the y go from 0 to 1 This has been done. 4. Fig. 4 how were low and medium determined? Also, isn’t this a continuous variable? If the categorization was just for visualization purposes, please state so in the caption. This information has now been added to the legend of the figure. DISCUSSION Assuming that the general finding holds (no evidence of learning as measured by [exclusive] preference of palatable kernels, let alone individual differences), this seems fine. I appreciate the discussion of how things can go awry in the field. Two minor points to perhaps discuss in more depth: 1. The novel food wasn’t really novel, as the baboons had all eaten corn before. Just the color and bitter taste were new. To what extent do you think their previous experience/learning explains why they didn’t learn (more) here? If they already knew that it was nutritious and not poisonous, they may well have learned that one tasted bitter, but that might have simply not been enough to outweigh a free meal (especially when food is scarce, as you mention). In the “Experimental Procedure”, we clarify that: “we considered it unlikely that these individuals’ responses to the task would be any different, as all the baboons had prior experience with corn kernels [3] and none had previously participated in a task involving colour cues indicating variation in palatability” (L204-207). We also detail in the Discussion section the possibility that animals learn that one option was “safe”, albeit bitter tasting, and that ultimately the kernels were too nutritious to avoid entirely (L550-552). To this end, we have now included a few of lines to reflect your suggestion (L557-559). 2. Is there any other evidence in the literature that individual differences play less of a role during times of food scarcity because everybody is similarly limited in what they can do? If so, that may be worth including. While we were unable to find a study that directly tested the role of individual differences in response to the availability of food resources, we wanted to reflect the reviewer’s comment in our Discussion. We have now added the text: “Despite substantial literature detailing the influence of unfavourable environment on animals’ cognitive abilities and cue use, there is little empirical evidence on the phenotypic plasticity of labile traits related to cognition. Consequently, we have a poor understanding of cognitive variability in fluctuating, rather than predictable and constant, environments” (L597-601) Minor Comments 1. The use of “incorrect/correct” kernels sounds a bit awkward and seems unnecessary. I suggest simply using “unpalatable/palatable” throughout the manuscript. We have made these changes throughout. 2. Please carefully check your references. Some refs listed in the list are not listed in the text (e.g., [5, 36, 63, 64]) and refs [58] and [59] are identical. References have been checked. 3. Line 35: should be “individuals’ phenotypes” (with an s) Done. 4. Line 47: should be “aspects such as foraging behavior” (add “as”) Done. 5. Lines 90, 93, 95: I’d recommend italics instead of all-caps to emphasize the either/or’s Done. 6. Line 110: better “female baboons would” (instead of “will”) Done. 7. Line 203: should be “prior experience with corn kernels” (instead of “of”) Done. 8. Line 388: should be Table 2 (instead of 1) Done. References 1. Zuur AF, Ieno EN, Walker N, Saveliev AA, Smith GM. Mixed Effects Models And Extensions In Ecology With R [Internet]. Springer, editor. New York, NY: Springer New York; 2009. (Statistics for Biology and Health). 2. Bolker BM, Brooks ME, Clark CJ, Geange SW, Poulsen JR, Stevens MHH, et al. Generalized Linear Mixed Models: A Practical Guide For Ecology And Evolution. Trends Ecol Evol. 2009;24(3):127–35. 3. Marshall HH, Carter AJ, Ashford A, Rowcliffe JM, Cowlishaw G. Social Effects On Foraging Behavior And Success Depend On Local Environmental Conditions. Ecol Evol. 2015;5(2):475–92. Submitted filename: Response to reviewers_20200123.docx Click here for additional data file. 4 Mar 2020 PONE-D-19-25653R1 Exploring individual variation in associative learning abilities through an operant conditioning task in wild baboons PLOS ONE Dear Dr. Carter, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. I require to address the following very minor points before the manuscript can be accepted: - L 40: two full stops (typo) - L72-73: the sentence is a bit unclear, I suggest to modify it as follows: “Conversely, the failure of captive animals to solve tasks that they would never encounter in the wild may equally distort our estimate of that species’ cognitive abilities”. - L 83: two square brackets (typo) - L 135: I suggest to delete the word “personality” here, since personality is a complex construct; similarly, on l 480 I suggest to replace “personality” with “neophilia” - L 323: the parenthesis at the end of the sentence is missing - L 422, 540 please replace the semicolon with a comma - L 544 (and 561) since “tendency” has a very precise meaning in statistics (0.05 < p < 0.1), unless this is the case, I suggest to replace this word with an alternative one - L 551 please erase the comma after “study” We would appreciate receiving your revised manuscript by Apr 18 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Elsa Addessi Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: All minor corrections requested have been suitably addressed by authors. I look forward to seeing this work published. Reviewer #2: This paper details a study on individual differences in wild baboons’ abilities to learn a two-choice color/food discrimination. It nicely highlights that even well-designed and seemingly simple tests of animal behavior and cognition can be heavily affected by environmental conditions. I thank the authors for their clarifications in the manuscript and thoughtful responses to the comments/suggestions raised in the review process. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. 6 Mar 2020 1. L 40: two full stops (typo) This has been corrected. 2. L72-73: the sentence is a bit unclear, I suggest to modify it as follows: “Conversely, the failure of captive animals to solve tasks that they would never encounter in the wild may equally distort our estimate of that species’ cognitive abilities”. This sentence has been modified as suggested. 3. L 83: two square brackets (typo) The additional bracket has been removed. 4. L 135: I suggest to delete the word “personality” here, since personality is a complex construct; similarly, on l 480 I suggest to replace “personality” with “neophilia” All mentions of “personality” have been replaced by the work “neophilia”. 5. L 323: the parenthesis at the end of the sentence is missing This has been added. 6. L 422, 540 please replace the semicolon with a comma Done. 7. L 544 (and 561) since “tendency” has a very precise meaning in statistics (0.05 < p < 0.1), unless this is the case, I suggest to replace this word with an alternative one. We have modified each sentence to replace the word “tendency”. In L 544-545 the sentence now reads: “This interpretation is supported by our observation that the baboons left slightly more bitter kernels uneaten”. In L562-563 the sentence now reads: “Such a result may be indicative of a species’ selective attention towards red food items”. 8. L 551 please erase the comma after “study” Done. Submitted filename: Response to reviewers (final).docx Click here for additional data file. 10 Mar 2020 Exploring individual variation in associative learning abilities through an operant conditioning task in wild baboons PONE-D-19-25653R2 Dear Dr. Carter, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Elsa Addessi Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 13 Mar 2020 PONE-D-19-25653R2 Exploring individual variation in associative learning abilities through an operant conditioning task in wild baboons Dear Dr. Carter: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Elsa Addessi Academic Editor PLOS ONE

38 in total