Literature DB >> 30202509

Chasing Animals With Split Attention: Are Animals Prioritized in Visual Tracking?

Thomas Hagen¹, Thomas Espeseth², Bruno Laeng¹.

Abstract

Some evolutionary psychologists have hypothesized that animals have priority in human attention. That is, they should be detected and selected more efficiently than other types of objects, especially man-made ones. Such a priority mechanism should automatically deploy more attentional resources and dynamic monitoring toward animal stimuli than nonanimals. Consequently, we postulated that variations of the multiple object or identity tracking and multiple event monitoring tasks should be particularly suitable paradigms for addressing the animate monitoring hypothesis, given their dynamic properties and dependency on divided attention. We used images of animals and artifacts and found neither a substantial sign of improvement in tracking the positions associated with animal stimuli nor a significant distracting effect of animals. We also failed to observe a significant prioritization in orders of response for positions associated with animals. While we observed an advantage for animals in event monitoring, this appeared to be dependent on properties of the task, as confirmed in further experiments. Moreover, we observed a small but inconclusive advantage for animals in identity accuracy. Thus, under certain conditions, some bias toward animals could be observed, but the evidence was weak and inconclusive. To conclude, effect sizes were generally small and not conclusively in favor of the expected attentional bias for animals. We found moderate to strong evidence that images of animals do not improve positional tracking, do not act as more effective distractors, are not selected prior to artifacts in the response phase, and are not easier to monitor for changes in size.

Entities: Chemical Disease Species

Keywords: animacy; animals; animate monitoring hypothesis; attention; multiple event monitoring; multiple identity tracking; multiple object tracking

Year: 2018 PMID： 30202509 PMCID： PMC6124190 DOI： 10.1177/2041669518795932

Source DB: PubMed Journal: Iperception ISSN： 2041-6695

Introduction

According to the animate monitoring hypothesis (New, Cosmides, & Tooby, 2007), modern humans have inherited a perceptual mechanism that automatically monitors animate objects (i.e., humans and animals) or, in other words, items with high biological relevance (e.g., prey or predators). Such a mechanism should have provided our ancestors with the ability to quickly notice and keep track of nearby animals such that one could (adaptively) avoid becoming prey or miss an opportunity to hunt. While the above hypothesis appears plausible from an evolutionary perspective, it is relevant to point out that a fundamentally similar distinction in the human semantic system between animate and inanimate (e.g., man-made artifacts like tools) has also been documented as patterns of selective impairments in neuroanatomically damaged patients (e.g., Capitani, Laiacona, Mahon, & Caramazza, 2003; Caramazza & Shelton, 1998; Gainotti, 2000, 2010; Hillis & Caramazza, 1991; Mahon & Caramazza, 2009; Warrington & Shallice, 1984). That is, in several neuropsychological studies, it has been shown that some patients can show a striking deficit for identifying animals while having a nearly intact ability to identify artifacts, whereas other patients show the reverse dissociation. Such findings are, however, not limited to patients, as a normal category-specific tendency in object identification has been observed in healthy individuals as well (Capitani, Laiacona, Barbarotto, & Trivelli, 1994; Låg, 2005; Låg Hveem, Ruud, & Laeng, 2006; Laws & Hunter, 2006; Laws & Neve, 1999). Furthermore, neurophysiological studies have shown a distinction between animate and inanimate objects in both humans and monkeys (e.g., inferotemporal response clustering of animate and inanimate objects; Kriegeskorte et al., 2008) and functional imaging have indicated similar distinctions (Konkle & Caramazza, 2013; Sha et al., 2015; Wiggett, Pritchard, & Downing, 2009), even in blind subjects without any prior visual experience (Mahon, Anzellotti, Schwarzbach, Zampini, & Caramazza, 2009). While it remains unclear whether the neuropsychological observations described earlier stem from an innate or acquired distinction (e.g., Gainotti, 2015), they do suggest a strong relevance of visual and semantic classifications of animate and inanimate objects within, at least, the cognitive system of the primate brain. However, the animate monitoring hypothesis specifically proposes the presence of a low-level innate and adaptive mechanism for the classification of animate and inanimate objects. More specifically, according to New et al. (2007), animals should spontaneously and preferentially recruit more visual attention than artifacts regardless of their relevance to the task. Another critical aspect of the hypothesis is that, as animals in a natural setting can rapidly change their trajectory or position in a fraction of a second, the system should not only be geared toward detecting animals but also to actively monitor them in an ongoing manner through frequent inspections of their status (New et al., 2007). Accordingly, we expected that animals will bias the spatial distribution of attention and cause stronger spontaneous recruitment of attention. One area of category-specific attentional biases that have received some consideration and which, intuitively, is similar in nature to an attentional bias toward animals is the case of visual tracking of human faces (e.g., Li, Oksama, Nummenmaa, & Hyönä, 2017). Accordingly, one could anticipate that paradigms sensitive to human faces would also be sensitive to the presence of animals. The seminal study by New et al. (2007) showed that animals were more readily detected in a change detection task (cf. Simons & Levin, 1997) where photographs containing animals and artifacts (man-made objects) would rapidly change, after a brief blank screen, so that an object would be briefly present or absent in quick succession while having its masked motion signal (see also Altman, Khislavsky, Coverdale, & Gilger, 2016). Crucially, the active monitoring aspect of the hypothesis is presumed to protect the animal stimuli from the effects of occlusion or interruptions of the visual scene. A wealth of studies have shown that the change in this task is not noticed immediately but takes time to be identified, as it is strongly dependent on the location of the focus of attention. Thus, the paradigm can be used fruitfully to evaluate the ability of specific items (or locations) to capture attention based on the lag between onset of the flickering image and a correct identification. Although New et al. (2007) gathered convincing evidence for an animacy advantage, such a finding has been questioned recently on methodological grounds (Hagen & Laeng, 2016; see also Hagen & Laeng, 2017), that is, it remains possible that the effect could stem from uncontrolled factors relating to aspects of the photographic scenes rather than of the object categories per se. This does of course not rule out that an attentional bias might actually exist as hypothesized and be measurable with other or more controlled tasks. In fact, recent research has shown that animate objects tend to be more easily localized in a visual search task (Jackson & Calvillo, 2013) as well as being more noticeable in an inattentional blindness task (Calvillo & Hawkins, 2016; Calvillo & Jackson, 2014). Hence, it seems important to explore in more depth the mechanisms involved and how they could relate to other aspects of attention. In particular, we reasoned that animacy, given that it is grounded in motion and dynamic action (Pratt, Radulescu, Guo, & Abrams, 2010), it should be especially relevant for dynamic aspects of attentional performance, that is, when visually tracking dynamic (moving) objects. Indeed, in daily life, objects frequently change positions within our visual field, be it they change physical positions or because we move our body and eyes. Consequently, to monitor objects in our environment, we are taxed with the challenge of continuously and dynamically updating their positions. A task or experimental paradigm frequently used to study this ability is the multiple object tracking (MOT) task (e.g., Alnæs et al., 2014; Cavanagh & Alvarez, 2005), where participants are asked to track a subset of identical moving objects on a screen. While this paradigm was originally developed to study the limits of spatial and divided attention, it has also been used to study the binding of features and identities to the tracked objects (what and where dimensions; Cohen, Pinto, Howe, & Horowitz, 2011; Horowitz et al., 2007; Oksama & Hyönä, 2008; Pylyshyn, 2004) by assigning identities to objects or displaying them as unique objects in a multiple identity tracking (MIT) task. In particular, recent studies have indicated that certain categories or properties of the tracked objects can influence participants’ ability to keep track of objects’ identity and position. These studies have shown biases for stimuli such as angry faces (Li et al., 2017), attractive faces (Li, Oksama, & Hyönä, 2016; Liu & Chen, 2012), fearful faces (Jin & Xu, 2015), and some familiar objects (Oksama & Hyönä, 2008; Pinto, Howe, Cohen, & Horowitz, 2010). Moreover, results from a neuroimaging study that contrasted tracking of unique objects with tracking of a uniform set of objects have shown that temporal brain regions, typically associated with object recognition, are more engaged in the tracking of unique objects than in the tracking of a uniform set of objects (Nummenmaa, Oksama, Glerean, & Hyönä, 2017). Thus, performance on this task is sensitive to object content and appears to influence tracking ability beyond low-level features, suggesting that higher order processing can influence positional tracking. Given that the task is dependent on continuously distributed and divided attention and can be influenced by how the objects are individuated, it should be sensitive to a mechanism favored by natural selection for its ability to automatically and adaptively deploy sustained monitoring of the locations of animate objects (cf. New et al., 2007). Another recent development in this tracking paradigm is the multiple event monitoring (MEM) task (Wu & Wolfe, 2016) where participants are instructed to keep track of multiple moving images of objects while also monitoring them for changes. This task has to date not been employed to investigate attentional biases, but its requirement to actively monitor the status of objects for changes should be highly relevant for investigating the animate monitoring hypothesis. As a bias toward automatically monitoring animals for changes in position or state should have had significant survival value for human ancestors, New et al. (2007) specified that the system did not just evolve to detect animals but also to autonomously monitor animals in an ongoing manner. In essence, the system should be sensitive to moving objects that looks like animals if these aspects played a role in the natural selection of the system. We believe that being able to document an attentional bias toward animals with dynamic tracking tasks should be greatly beneficial to a further understanding of the extent or limits of animacy’s ability to influence dynamic, distributed, and sustained visual attention. Hence, the goal of this study is to attempt to document the presence and extent of such a bias. Specifically, estimates of effect sizes, a sufficient level of power, and Bayesian approaches appear to be necessary, as potentially nonsignificant results cannot be used as conclusive evidence for a particular effect being absent (Dienes, 2014). Furthermore, this approach should help narrowing down the set of situations where animate monitoring can have a sizable influence on the perceptual and attentional system. More specifically, in line with the animate monitoring hypothesis, we expected to find that attention prioritizes animals in an automatic manner. Associating a task-relevant object with animacy (i.e., an image of an animal) should promote strong attentional allocation and vigilance toward that object and this effect should be measurable as prioritized responses as well as improved tracking and monitoring ability compared with objects that are not associated with animacy. Likewise, task-irrelevant animal distractors should be particularly capable to divert attention away from task relevant objects, which should be measurable as an increase in animal distractors being incorrectly reported as targets. In other words, participants should report animals more frequently than artifacts, irrespective of their status as targets or distractors.

Experiment 1

One model of how tracking takes place in MOT proposes that the observer allocates one focus of attention per target (Cavanagh & Alvarez, 2005) and, consequentially, a limited pool of neural resources gets divided among them (Alnæs et al. 2014; Kahneman, 1973). The aim of the present, initial, experiment was to attempt to influence this assignment process by presenting one of the targets as an animal. According to the animate monitoring hypothesis, animals should automatically capture attention more strongly than other objects. Hence, our hypothesis was that targets displayed as animals during the target assignment phase in an MOT task would bias the amount of resources assigned to it and thereby result in improved tracking accuracy. In addition, we predicted that this process of prioritization should also lead to a bias in the order in which targets are reported, and that animal distractors should be reported as targets more frequently than artifact distractors. More specifically, in the first experiment, two randomly positioned images of objects were used as targets while another set of 10 objects were used as distractors. After viewing the targets for a period of time sufficient to identify each of them uniquely, we occluded the objects with black disks during the tracking period such that the images were only visible during target assignment. We made the straightforward prediction that animal targets would be tracked more successfully than artifact targets. Second, as participants were free to report the targets in any order they liked, we predicted that animal targets would be reported (clicked on) before targets presented as artifacts due to the presumed prioritization process. Third, as targets’ localization errors would be dependent on target–distractor confusions, we predicted that when participants made erroneous responses, that is, a distractor was reported as a target, it would be more likely for such a distractor to be an animal rather than an artifact. This was expected from the supposedly automatic prioritization of animals in attention, which should make it more likely for targets’ swaps (Drew, Horowitz, & Vogel, 2013) to be biased toward animal distractors (i.e., targets should be more confusable with animal distractors than artifact distractors). To probe this, we computed the percentage of incorrect responses per participant, as the percentage of animal and artifact distractors (across all trials) that were selected during the response phase. Power analysis based on an estimated mean of .3 from previous studies (Jin & Xu, 2015; Li et al., 2016, 2017; Liu & Chen, 2012) indicated that at least 22 participants were required to obtain 80% power in detecting a 4% advantage in accuracy for animals (d = 0.65, d = 0.40). However, to encompass an even smaller advantage of 2.5%, we aimed to test at least a twofold sample of about 50 participants ( = .15, d = 0.40, d = 0.25). Some of the experiments in the original study by New et al. (2007) were designed to have 80% power to detect an effect size of d = 0.47. Although no specific reasoning for this level of sensitivity has been provided, this may serve here as an indication of the theorists’ intended minimum effect size of interest. At any rate, our experiment should be able to detect an effect size smaller than the minimum implied by the existing reports in favor of the animate monitoring hypothesis. In fact, we should be able to detect an effect size of a magnitude much smaller than what has been typically detected in studies on attentional biases (Jin & Xu, 2015; Li et al., 2016, 2017; Liu & Chen, 2012; New et al., 2007). To be able to quantify the evidence for and against a given hypothesis, we used JASP (https://jasp-stats.org/) to calculate Bayes factors (BFs; with JASP’s default prior). We report BF01 in favor of the null hypothesis, expressing the probability of the data given the null hypothesis relative to the alternate hypothesis (e.g., a value of 7 would suggest that the observed data are seven times more likely to have occurred under the null hypothesis than under the alternate hypothesis). Specifically, as the value of BF01 increases above 1, there is more evidence in support of the null hypothesis (e.g., that an effect is likely to be absent). Conversely, as the value decreases below 1, there is more evidence in support of the alternate hypothesis (e.g., that an effect is likely to be present). Inverting BF01 (1/BF01) yields BF10, which expresses how likely the data are under the alternate hypothesis relative to the null hypothesis (Dienes, 2014; Jarosz & Wiley, 2014). The BFs can be further interpreted or categorized based on the obtained value, for example, a BF01 in the range 1 to 3 (or BF10 within 1–0.33) can be viewed as anecdotal (i.e., weak, inconclusive) evidence (see Andraszewicz et al., 2015; Wetzels, Ravenzwaaij, & Wagenmakers, 2015, for the interpretations adopted here).

Methods

Participants

We recruited 68 (20 women) participants with a mean age of 33 years (range: 18–57 years, standard deviation [SD] = 8.81 years). All were recruited with Crowdflower®. All participants agreed to an informed consent approved by the institute’s internal review board (Reference No. 1439337) and in accordance with the Declaration of Helsinki.

Apparatus

The experiment was implemented with JavaScript and each participant ran the experiment on their own computer, as is typically the case with crowdsourcing experiments (Crump, McDonnell, & Gureckis, 2013).

Stimuli

We used the Snodgrass and Vanderwart’s (1980) stimulus set of line drawings to select 20 animal and 20 artifact images that were balanced on complexity ratings, animals: M = 3.76, SD = 0.48, artifacts: M = 3.62, SD = 0.46, t(37.98) = 0.91, p = .366, familiarity ratings, animals: M = 2.66, SD = 0.84, artifacts: M = 2.87, SD = 0.71, t(36.98) = 0.83, p = .414, and number of pixels, animals: M = 1,684, SD = 460, artifacts: M = 1,804, SD = 550, t(36.85) = 0.75, p = .456. The images of animals were the following: alligator, ant, bird, cat, chicken, cow, deer, dog, donkey, elephant, fox, gorilla, horse, mouse, peacock, pig, rabbit, rooster, snake, and squirrel. The images of artifacts were the following: airplane, baby carriage, barn, barrel, bicycle, cannon, church, French horn, gun, helicopter, iron, motorcycle, rocking chair, roller skate, sailboat, trumpet, violin, wagon, watch, and whistle. These images were then used to generate MOT trials with one animal and one artifact as targets and five animals and five artifacts as distractors. A pool of 240 unique paths were used to select paths for each participant in the following way: First, we selected 30 random paths and assigned images to them before we copied the paths and images while switching the category of the targets to create 30 new paths. Thus, balancing object movements across target categories so that any difference between categories is not attributable to a set of paths being easier for one category than for another, as they were identical. The repetition of paths should not be of significant concern with regard to improving performance by chance for one specific category, as trial orders were randomized across participants and a large number of repetitions would be typically required to improve performance considerably (Ogawa, Watanabe, & Yagi, 2009). This setup resulted in 60 experimental trials per participant.

Procedure

The task started with the presentation of 12 randomly positioned and nonoverlapping objects for 200 milliseconds before two of them were designated as targets by enclosing them in red circles for 1,500 milliseconds (see Figure 1(a)). Next, the red circles flashed for 1,500 milliseconds before the objects were occluded by black disks and started moving around the screen. The tracking period lasted for a random duration between 5 and 7 seconds, but durations were the same between identical paths. The display was redrawn at a rate of 30 frames per second, and the objects moved with a speed of 16 pixels per frame in a display measuring 1,200 × 800 pixels. The displays were scaled dynamically to encompass differences in screen resolutions by resizing the display area to fit within the browser window of devices not supporting the full resolution. Participants were instructed to click on the target objects as soon as their movement stopped. Feedback was given by indicating the number of correctly identified objects. Each participant was required to complete five practice trials with at least 75% correct prior to starting the experiment. The practice trials contained a different set of images than those used in the main experiment (Op de Beeck & Wagemans, 2001). Task instructions were presented with textual stepwise descriptions and illustrations as well as a video demonstration of the task.

Figure 1.

Illustration of a trial in Experiment 1. First, the targets were indicated by enclosing them in red circles (a), then all objects were hidden behind black disks (b) before they started moving around the screen (c). Participants indicated the positions of the targets when their movement stopped by clicking on them (d), which also made the objects visible, to provide feedback.

Results

Before conducting the statistical analysis, we removed data from five participants for having a mean accuracy that was below 50% (1.5 SD below the median). A t test on tracking accuracy between the two categories showed no significant difference in accuracy between animal and artifact targets (see Figure 2), t(62) = 0.9, p = .37, 95% confidence interval (CI) [−0.91, 2.4], d = 0.11, d = 0.058, common language (CL) effect size = 55% (see Lakens, 2013, for reported effect size estimates). Animal targets had a mean accuracy of 77.9% (SD = 12.1%), while artifact targets had a mean accuracy of 77.2% (SD = 13.2%). A Bayesian paired samples t test showed moderate evidence (Wetzels et al., 2015) for the null hypothesis, BF01 = 4.93, suggesting that our data are 4.93 more likely to be observed under the null hypothesis than the alternate hypothesis.

Figure 2.

Combined bar and scatter plots on mean accuracy, response orders (lower numbers indicate earlier responses), and percentage of incorrect responses (distractors reported as targets) over target category in Experiment 1 (i.e., percentages of animal and artifact distractors that were selected during the response phase). Error bars show standard errors, and the superimposed scatterplots show mean values of each participant. To investigate whether participants reported animal targets before artifact targets, we conducted a t test on mean response orders, which showed no significant difference, t(62) = 0.26, p = .8, 95% CI [ − 0.024, 0.032], d = 0.033, d = 0.041, CL = 51% (see Figure 2). Animal targets had a mean value of 1.46 (SD = 0.096), while artifact targets had a mean value of 1.45 (SD = 0.084). Response orders ranged from 1 to 2, where 1 would represent the first response a subject made on a trial, while 2 would represent the last response. A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 7.012. To investigate whether animal distractors were reported as targets more frequently than artifact distractors, we conducted a t test on percentage of incorrect responses (distractors reported as targets) between animals (M = 49.5%, SD = 11%) and artifacts (M = 50.5%, SD = 11%). Two participants were removed from this analysis for having only one incorrect response. The result showed no significant difference, t(60) = 0.37, p = .71, 95% CI [−6.8, 4.7], d = 0.047, d = 0.095, CL = 52% (see Figure 2). A Bayesian t test showed moderate evidence in favor of the null hypothesis, BF01 = 6.68.

Discussion

The present results did not reveal statistically significant support for the hypothesis that presenting objects as animals and artifacts during the target assignment phase in an MOT task, should lead to (a) improved tracking accuracy for animal targets, (b) earlier responses for animal targets, or (c) more animal distractors being reported as targets. Similarly, the BFs consistently showed moderate support for the null hypothesis of no effect of images of animals across measures. Given these results, it seems unlikely that our measures are substantially different between the two types of images used. As with any experimental report, the research community should decide whether the potential for even smaller effect sizes than what our study was powered for is deemed interesting and worthwhile pursuing in larger samples. One reason for the present findings could be that simply presenting targets as animals during assignment is not sufficient to evoke a measurable bias. That is, the hypothesis rests on the assumption that prioritization is assigned to a token location of an animal and that this can be maintained as long as an attentional locus is assigned to the object, irrespective of the fact that the object no longer depicts the figure that could lead to such prioritization. Although several studies with MOT provide evidence that the visual system can track such items independent of their original identity (e.g., color), it is possible that the supposed attentional bias assigned to targets cannot be easily maintained as the objects turn to black disks and change positions during the several seconds of the tracking phase. Essentially, the presented images during the target assignment phase might have been mostly irrelevant to participants and, perhaps consequentially, became irrelevant for tracking performance as well. Hence, in the next experiment, we maintained the visibility of the images. To prevent ceiling performance in this task, we had set a speed deemed to be sufficient for yielding errors. One possibility is that such a relatively high speed was not appropriate to uncover an advantage for animals. Thus, in the next experiment, we increased the number of targets while lowering the speed in an attempt to cause more competition between multiple attentional foci and thus induce stronger priority.

Experiment 2

The first experiment failed to indicate any attentional biases toward animals. One possibility is that the previous experiment did not pose enough competition between attentional foci to bring about an advantage for animals. According to New et al. (2007), high levels of focused attention should reduce the impact of task-irrelevant nonanimals more than task-irrelevant animals. Thus, using four targets and lowering the speed of the objects (so as to not make the task too difficult), we aimed to make it more relevant for the attentional system to perform prioritizations. Moreover, most previous studies using MOT or MIT to investigate attentional biases have used three to six targets (Jin & Xu, 2015; Li et al., 2016, 2017; Liu & Chen, 2012). In addition, most studies demonstrating improved tracking performance for particular categories of objects (Li et al., 2017; Liu & Chen, 2012) have kept the objects visible during the tracking phase. Such an experimental design may also be more suited to test the animate monitoring hypothesis, as it specifically proposes that animals should be monitored continuously, in an ongoing manner. This may presuppose that their shape is visible while they are tracked, at least for most of the time. Besides, in ecological conditions, we rarely track a subset of identical objects that previously had a visible identity and then lost it. Hence, we generated a version of the task that would seem closer to natural scenarios. For completeness, a version with the objects hidden during tracking was also conducted at an early stage of the study and is included as Supplementary Experiment 1. As for the previous experiments, the main prediction was that animals would be tracked more successfully as targets than artifacts. Second, we predicted that animal targets would be reported before artifact targets. Third, we predicted that when participants made erroneous responses, it would be more likely for an animal distractor to be reported as a target than for an artifact distractor. Conceivably, the present design should be more sensitive to this aspect, as the continuously visible animals could grab attention at any time during tracking. We recruited 67 participants (13 women) with a mean age of 32.3 years (range: 19–59 years, SD = 9.52 years). We used the same images as in Experiment 1. This was similar to Experiment 1, except for the following: We used four targets (two animals and two artifacts), lowered the speed to 12 pixels per frame to avoid making the task too difficult, and objects were visible during the tracking period but hidden on the last frame before the movements stopped (see Figure 3(c)). Combined with a variable trial duration, the change in procedure aimed to avoid a strategy of simply remembering how the targets looked in order to report them correctly (i.e., they were required to continuously keep track of them).

Figure 3.

Illustrationof a trial in Experiment 2. Objects were visible during assignment (a) and tracking (b) but hidden at the last frame (c) before the response phase (d).

Illustrationof a trial in Experiment 2. Objects were visible during assignment (a) and tracking (b) but hidden at the last frame (c) before the response phase (d). Before performing statistical analysis, we removed three participants for having a mean accuracy below 50% (1.5 SD below the median). A t test on accuracy between animal (M = 70.7%, SD = 11.2%) and artifact (M = 71.1%, SD = 11.2%) targets showed no significant difference, t(63) = 0.4, p = .69, 95% CI [−2.5, 1.7], d = 0.049, d = 0.037, CL = 52% (see Figure 4). A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 6.77.

Figure 4.

Combined bar and scatter plots for Experiment 2 on mean accuracy, response orders, and percentage of incorrect responses by category. Error bars show standard errors, and the superimposed scatterplots show mean values from each participant. In contrast to the previous experiment, response orders ranged from 1 to 4, where 1 would represent the first response a subject made on a trial, while 4 would represent the last response. A t test on response order between animal (M = 2.29, SD = 0.18) and artifact (M = 2.26, SD = 0.16) targets also showed no significant difference, t(63) = 1.4, p = .15, 95% CI [−0.012, 0.073], d = 0.18, d = 0.18, CL = 57%. A Bayesian t test showed anecdotal evidence for the null hypothesis, BF01 = 2.702. Then, to investigate whether animal distractors were reported as targets more frequently than artifact distractors, we applied a t test on percentage of incorrect responses (distractors reported as targets) between animals (M = 50.1%, SD = 6.6%) and artifacts (M = 49.9%, SD = 6.6%). Again, this analysis also showed no significant difference, t(63) = 0.11, p = .91, 95% CI [−3.1, 3.5], d = 0.014, d = 0.027, CL = 51% (see Figure 4). A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 7.256. Despite increasing the number of targets and continuously displaying the target objects as animals and artifacts during tracking, we did not observe significantly more accurate tracking of target animals as compared with artifact targets. In addition, in line with the previous experiment, we failed to observe a significant precedence in reporting of animal targets in this experiment as well. Finally, we also failed to observe significantly more frequent erroneous reporting of animal distractors compared with artifacts. Despite the fact that each trial was structured to induce competition between animal and artifact attentional foci in the presence of hypothetically attention-grabbing animal distractors, we failed to reject any of the null hypotheses. In fact, the obtained BFs showed moderate evidence for the null hypothesis for the measures on accuracy and percentage of incorrect responses. However, the BF for response orders was only anecdotal, which does not warrant a firm conclusion on its support for the null hypothesis. Rather, the data appear insensitive in distinguishing between the null and the alternative hypothesis for the measure on response orders (Dienes, 2014). In summary, it appears that artifacts can be tracked just as efficiently as animals, and we found no conclusive evidence for prioritizations of either category, nor did we observe that animate objects were conclusively more effective in capturing attention in target–distractor confusions, even though the images were continuously visible. Again, researchers should decide whether the potential for even smaller effect sizes than what our study was designed for is deemed relevant.

Experiment 3

In the previous couple of experiments, the identity of the objects was mostly irrelevant to the task; thus in the following experiment, we made the identity of the objects explicitly relevant, using image probes during the response phase, while requiring participants to localize them. Previous work using such probes has indicated more successful tracking of object properties presumed able to induce attentional biases (i.e., attractive faces and emotional expressions: Jin & Xu, 2015; Li et al., 2016; Liu & Chen, 2012). Moreover, using such image probes during the response phase, we required participants to be aware, at all times, of which objects were tracked and where these were located. According to the animate monitoring hypothesis, this type of explicit requirement should not be necessary for observing a bias toward animals, as the bias is supposed to behave in an automatic way regardless of current goals. However, as the previous experiments failed to bring about a substantial advantage for animals, we reasoned that the situation constructed here could increase the chance of revealing such a bias. We expected that in such conditions, the binding and tracking of animal targets would be more successful than for artifacts. Thus, the task is similar to Experiments 1 and 2, except for making the appearance of the objects at assignment directly relevant for performance. Due to the extensive literature on category-specific deficits for animals in naming, recognition and memory (e.g., Capitani et al., 1994; Låg, 2005; Låg et al., 2006; Laws & Hunter, 2006; Laws & Neve, 1999; Nairne, VanArsdall, & Cogdill, 2017; Nairne, VanArsdall, Pandeirada, Cogdill, & LeBreton, 2013), it seems difficult to purely attribute an effect of superior identity tracking accuracy for animals as stemming from an attentional bias. Consequently, we designed for the acquisition of position accuracy measures as well, by making the task sufficiently difficult, so as to avoid ceiling effects. The design of this study allowed for investigating both identity tracking performance and position tracking performance. We defined identity accuracy as the percentage correct localizations of the probe images displayed at the bottom of the screen (see Figure 5). We defined position accuracy as the percentage correct localizations of targets, irrespective of their identities. The expectation was that the identity of animal targets would be tracked more successfully than the identity of artifact targets and, consequently, we expected their positions to be tracked more successfully as well. As in the previous experiments, we also predicted that animal distractors would be reported as targets more frequently than artifact distractors.

Figure 5.

Illustration of a trial in Experiment 3. First targets were assigned by enclosing them in red circles (a), then all objects started moving around the display (b) before being hidden at the last frame when the movements stopped (c). Probes appeared at the bottom of the display during the response phase (d), where participants indicated the position of the probes. Another version of this experiment, kept the objects hidden during tracking in an attempt to make the bindings between identity and positions more volatile. We included this experiment as Supplementary Experiment 2. We recruited 71 participants (21 women) with a mean age of 32.2 years (range: 18–57 years, SD = 8.3 years). We used the same set of images as in the previous experiments. This was similar to Experiment 2, with the exception that participants were required to indicate the position of the objects displayed at the bottom of the screen in the response phase (see Figure 5(d)). Identity accuracy was based on how accurately participants could localize the individual targets after the tracking period, while position accuracy was defined as how accurately target positions were reported irrespective of their identity. Each target was probed sequentially in a counterbalanced manner between animals and artifacts. The circles turned red when clicked on during the response phase. The incorrect and correct objects were revealed along with feedback about accuracy once the required number of objects had been reported. Before conducting the statistical analysis, we removed the data from five participants for having mean identity accuracy below 35% (1.5 SD below the median). A t test on identity accuracy showed no significant difference between animal (M = 59.4%, SD = 14.4%) and artifact (M = 57.4%, SD = 13.5%) targets, t(65) = 1.3, p = .19, 95% CI [−1, 5.1], d = 0.16, d = 0.14, CL = 56%. A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 3.238. A further t test on position accuracy showed no significant difference between animal (M = 71.2%, SD = 14.1%) and artifact (M = 70.7%, SD = 12.2%) targets, t(65) = 0.29, p = .78, 95% CI [−2.5, 3.4], d = 0.035, d = 0.032, CL = 51% (see Figure 6). A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 7.121.

Figure 6.

Combined bar and scatter plots for Experiment 3 on mean identity accuracy, position accuracy, and percentage of incorrect responses by category. Error bars show standard errors, and the superimposed scatterplots show mean values from each participant. Identity accuracy shows how accurately participants could localize the individual targets after the tracking period. Position accuracy shows percentage correct localizations of targets, irrespective of their identities. Before analyzing the percentage of incorrect responses on distractors by category, we removed two participants for having five or less incorrect responses. A t test between animals (M = 50.6%, SD = 6.8%) and artifacts (M = 49.4%, SD = 6.8%) showed no significant difference, t(63) = 0.74, p = .46, 95% CI [−2.1, 4.7], d = 0.092, d = 0.18, CL = 54%. A Bayesian t test showed moderate evidence for the null hypothesis, BF01 = 5.627. Although the images were continuously visible throughout tracking and participants were explicitly required to track their identities, we failed to observe statistically significant advantages for animal targets over artifact targets in identity and position accuracy as well as in percentages of incorrect responses. Consistently, the obtained BFs showed moderate evidence for the null hypothesis of no effect of images of animals. It thus seems unlikely that our measures were substantially different between the two types of images used. An alternative design could have required participants to locate the objects by name rather than image, which could in turn have promoted a strategy for encoding more semantic aspects of the objects. However, according to the animate monitoring hypothesis, explicit semantic processing of animals should not be required for obtaining an attentional advantage.

Experiment 4

Some MIT variants in previous studies with facial stimuli have used designs where targets and distractors are either from the same or from the different categories (Jin & Xu, 2015; Li et al., 2016, 2017; Liu & Chen, 2012). Such a design allows for the simultaneous testing of differences in the ability of the categories in holding and attracting attention. Moreover, this design allows participants to separate targets and distractors categorically in a subset of trials, which may relax the need to relay on object identity during tracking. While it may not be clear why this arrangement should be more sensitive to an attentional bias for animals than our previous attempts, our primary motivation was to use a design that has had history of successfully demonstrating biases to categories of objects. An underlying assumption in these studies is that the binding of an identity, which may have an associated attentional bias, to its position, should improve tracking performance of that position, as it moves around the display, independently of the explicit requirement of tracking its identity (Li et al., 2017). Despite this apparent assumption, most previous studies have focused on the acquisition of identity accuracy measures. In fact, the majority of studies with facial stimuli and identity probes did not analyze position accuracies due to ceiling effects (Li et al., 2016, 2017), but one study reported an advantage for fearful over neutral faces in both position tracking accuracy and identity tracking accuracy (Jin & Xu, 2015). Thus, we specifically designed the experiment to obtain position accuracies as well. With this setup, we predicted that animals would yield an advantage in both identity accuracy and position accuracy. Moreover, we predicted that animal distractors would lead to more errors than artifact distractors. We recruited 67 participants (17 women) with a mean age of 33 years (range: 18–67 years, SD = 10 years). We used the same set of images as in the previous experiments. Similar to Experiment 3, except that targets were either four animals or artifacts, while distractors were either eight animals or artifacts. Before performing the statistical analysis, we removed four participants for having mean identity accuracy below 35% (1.5 SD below the median). An analysis of variance (ANOVA) on identity accuracy over target category (animal, artifact) and distractor category (animal, artifact) revealed a significant main effect of target category, F(1, 62) = 5.15, p = .027, = .08, < .01. There was, however, neither a significant effect of distractor category, F(1, 62) = 1.34, p = .251, = .02, < .01, nor a significant interaction, F(1, 62) = 3.83, p = .055, = .06, < .01. Animal targets had a mean accuracy of 56.2% (SD = 13.6%), while artifact targets had a mean accuracy of 54.4% (SD = 14%). Trials with animal distractors had a mean accuracy of 55.6% (SD = 14%), while trials with artifact distractors had a mean accuracy of 54.9% (SD = 13.7%). A Bayesian repeated measures ANOVA on identity accuracy revealed anecdotal evidence that target category had an effect, BF01 = 0.49, and moderate evidence for the null hypothesis for distractor category, BF01 = 4.5. An ANOVA on position accuracy over target category and distractor category revealed a significant main effect of target category, F(1, 62) = 4.90, p = .030, = .07, < .01, but not distractor category, F(1, 62) = 0.83, p = .366, = .01, < .01. The interaction effect was also significant, F(1, 62) = 13.50, p < .001, = .18, < .01 (see Figure 7). Animal targets had a mean accuracy of 71.1% (SD = 12.6%), while artifact targets had a mean accuracy of 72.1% (SD = 12.6%). The reverse of the identity results; animal targets leading to reduced position accuracy performance. Trials with animal distractors had a mean accuracy of 71.9% (SD = 12.6%), while trials with artifact distractors had a mean accuracy of 71.4% (SD = 12.6%), implying that artifacts were slightly more effective distractors than animals. A Bayesian repeated measures ANOVA on position accuracy revealed anecdotal evidence for the null hypothesis for target category, BF01 = 1.97, as well as moderate evidence for the null hypothesis for distractor category, BF01 = 5.61.

Figure 7.

Combined bar and scatter plots for Experiment 4 on mean identity and position accuracy by target category and distractor category. Error bars show standard errors, and the superimposed scatterplots show mean values from each participant. Identity accuracy shows how accurately participants could localize the individual targets after the tracking period. Position accuracy shows percentage correct localizations of targets, irrespective of their identities. In line with predictions, the identity of animal targets was reported significantly more successfully than the identity of artifact targets. However, contrary to the prediction that a similar advantage should be found in position accuracy, the results showed that artifacts were tracked significantly more successfully than animals. Based on these tendencies, it would seem that participants were better at tracking the identity of animals but not their positions. Suggesting that participants were slightly better at remembering where they saw a particular animal. In addition, we found no significant effect of animal distractors, which is in line with previous studies (Li et al., 2016, 2017) as well as our previous experiments. The obtained BFs for identity accuracy were mostly in line with the significance tests. However, the BFs helped to reveal that the evidence for the alternate hypothesis of target category was only anecdotal (Wetzels et al., 2015). Thus, we cannot conclusively state that animal identities were tracked better than artifact identities. The BFs also helped to cast doubt on the statistically significant result of target category in position accuracy by showing that the null hypothesis was 1.97 times more likely than the alternate hypothesis given the data. Thus, the results indicated anecdotal evidence for no difference between the categories in position accuracy. Although the Bayesian results did not warrant any conclusion with regard to the effect of category on identity and position accuracies, it is still interesting to consider that the indicated patterns of results might not necessarily be attributed to attention. As the results indicated that participants were not better at tracking positions associated with animals but were better at remembering what they depicted, this might suggest an advantage in memory (Nairne et al., 2013, 2017) or encoding (Hagen & Laeng, 2017). Indeed, more effective encodings of animals from brief exposures (as implied by the brief target inspections occurring in such tasks, Oksama & Hyönä, 2016) might yield the indicated advantage in reporting where particular animals were localized. Specifically, a recent study with rapid presentations of animals (Hagen & Laeng, 2017) showed that animal targets were encoded more successfully for later report than artifacts but still did not gain prioritized access to attention. Finally, it must be stressed that the observed effect sizes in the present experiment are relatively small and far from what should be expected from the original account (New et al., 2007). In fact, the Bayesian analysis indicated that the effects of category on identity and position accuracies were weak and inconclusive. Thus, future studies should aim to test a larger sample if the potential for such small effects are deemed interesting and worthwhile.

Experiment 5

While the previous experiments largely failed to observe any clear attentional biases for animals, the animate monitoring hypothesis specifically proposed that the mechanism evolved to monitor the location and state of animate objects. Consequently, the features offered by the change detection task were deemed important by the original investigators of the hypothesis (New et al., 2007). The investigation thus far has probed more the aspect of keeping track of the changing positions of animals, but we have not yet assessed the importance of actually monitoring the state of objects. It is also possible that the original change detection design (New et al., 2007) lacked some dynamic aspects which the mechanism may be particularly sensitive to, considering that it evolved in a dynamic and noisy world (e.g., animals may be moving about, but only certain aspects of their translations in space is relevant for behavior). Another aspect of the change detection task is that it relies on disrupting visual processing by blanking the screen to mask changes in state, which may have unknown influences on the putative monitoring system. Thus, the concept of combining an MOT task with a change detection task appears to have merits worthy of an investigation despite the apparent lack of evidence so far from either type of paradigms (e.g., Hagen & Laeng, 2016). Such combinations have been attempted in unrelated investigations (Bahrami, 2003; Oksama & Hyönä, 2008), relying on invasive disruptions of visual processing (blanking and mud splashes). A recent development, however, is the MEM task, where participants are required to continuously monitor the state of multiple objects moving randomly around a display (Wu & Wolfe, 2016). The task is thus similar to the MOT or MIT tasks, with the notable exception that participants are to monitor all objects for a specific change and respond as fast as possible when a change occurs. Importantly, in this paradigm, the objects are continuously visible as changes in state occur. Because changes in state can induce visual transients drawing attention to their location, the paradigm relies on small clockwise and counterclockwise rotations, of each stimulus throughout the duration of the tracking period, to mask the signal from a single transitory change in state, enforcing participants to pay close attention to the state of objects rather than relying on a transient signal from one of the objects. In the version of the task presented here, we changed the state of objects by manipulating their lateral (horizontal) orientation at a random point in time. This task thus combines the continuous distributed and dynamic attention aspects of the MOT or MIT paradigms with a change detection task requiring participants to respond as fast as possible when they detect a change in state of animals and artifacts. This task should thus be more similar to the type of task thought to be sensitive to the attentional bias for animals (New et al., 2007), that is, engaging active monitoring of the location and state of animals while imposing vigilance toward responding to changes that are relevant for behavior. The main prediction was that participants should detect changes to animals faster and more correctly than changes to artifacts. In addition, we used two levels of load, as previous research with this paradigm (Wu & Wolfe, 2016) has indicated that only about two to three objects can be tracked successfully (Wu & Wolfe, 2016); we decided to have trials with two or four objects. If humans’ typical tracking capacity is two to three objects, then tracking four objects should presumably help in bringing forward an advantage for animals, especially if these are prioritized in attention. For Experiment 5A, we recruited 60 participants (24 women) with a mean age of 31 years (range: 17–64 years, SD = 9.75 years). For Experiment 5B, we recruited 55 participants (8 women) with a mean age of 30 years (range: 17–67 years, SD = 10.7 years). For experiment 5A we selected 20 animals (alligator, ant, bear, cow, donkey, fish, fly, frog, gorilla, horse, kangaroo, lobster, monkey, mouse, penguin, pig, rabbit, rooster, seal, and snake) and 20 artifacts (airplane, baby carriage, bicycle, three cars, church, digger, gun, rocking chair, scooter, stroller, Swiss army knife, teapot, telescope, tractor, triangle ruler, trumpet, watering can, and whistle) that we judged to have clear directionality across six sets of line drawings (Bates et al., 2003; Bonin, Peereman, Malardier, Méot, & Chalard, 2003; Cycowicz, Friedman, Rothstein, & Snodgrass, 1997; Op de Beeck & Wagemans, 2001; Nishimoto, Miyawaki, Ueda, Une, & Takahashi, 2005; Snodgrass & Vanderwart, 1980). The groups of objects were matched on degree of visual change they would induce by matching them on the number of pixels that would vary when they changed orientation from left to right, t(37.509) = 0.21, p = .83, as well as number of pixels (size), t(36.168) = 0.32, p = .75. For Experiment 5B, we selected, from the same sources as Experiment 5A, 20 animals (ant, bird, bug, bull, cat, deer, dove, fawn, lion cub, monkey, mouse, raccoon, rhinoceros, sea horse, seagull, seal, spider, turtle, and two wolves) and 20 artifacts (ambulance, beer mug, camper, canon, car, caravan, two drills, flipper, grenade, hook, key, mixer, pipe, saw, stapler, traffic light, tricycle, violin, and wheelbarrow). The stimuli were also matched on how much they would change by being flipped (mirror flipped horizontally), t(37.648) = 0.13, p = .89, and number of pixels, t(34.41) = 0.28, p = .77. Each trial started with the presentation of a set of objects for 3 seconds. Next, the movement phase started and the objects moved randomly around the screen for 8 seconds (see Figure 8), or until response. The objects moved with a speed of 8 pixels per frame (30 frames per second) in a display measuring 800 × 800 pixels. Objects were also randomly tilted by 30° to the left or right for short durations (233 milliseconds) in order to mask any unique transients imposed by the change in orientation of the target object (Wu & Wolfe, 2016). Participants were instructed to look for a change in lateral orientation in any of the objects and press the space bar, as quickly as possible, to indicate that they had detected a change in orientation. Black disks immediately occluded the objects once the button had been pressed. Participants were then instructed to indicate the target by clicking on it with the mouse pointer. If they failed to press the button before 8 seconds had elapsed, the trial ended with the objects occluded and the participant guessed which one had changed. A change in orientation always happened within the time range of 2 to 6 seconds. Thus, there was a sufficient time to detect a change before the trial ended. Each trial contained an equal number of animals and artifacts. The experiment included 80 trials, divided over target category (animal, artifact) and number of objects (load 2, load 4). All trial movements were randomly generated for each participant, but movement patterns were matched between the target categories. Participants were required to complete six practice trials where at least four were correct and responded to within 2 seconds of the change.

Figure 8.

Illustration of the procedure in Experiment 5. First, targets were assigned (a), then all objects started moving around the display while frequently rotating 30° left and right (b), a change in lateral orientation occurred at a random time point between 2 and 6 seconds from start of tracking (c; notice that the church changes lateral orientation, as signaled by the position of the belfry). The movements stopped once a total of 8 seconds had elapsed or the observer pressed a button. Black disks immediately occluded the objects as the movements stopped (d). Participants then indicated the position of the changed object (e). Images adapted with permission (Snodgrass & Vanderwart, 1980, pp. 197–204).

Experiment 5A

To analyze accuracy, we ran an ANOVA on Category (animal, artifact) and Load (Load 2, Load 4), which showed significant main effects of Category, F(1, 59) = 12.71, p < .001, = .18, = .03, and Load, F(1, 59) = 149.29, p < .001, = .72, = .43. The interaction of the two factors was also significant, F(1, 59) = 13.91, p < .001, = .19, = .03. At Load 2, both categories had near ceiling performance, as animals had a mean accuracy of 98.2% (SD = 3.62%) and artifacts had a mean accuracy of 98.1% (SD = 3.03%), t(59) = 0.15, p = .88, 95% CI [ − 0.83, 0.97], d = 0.019, d = 0.02, CL = 51%. At Load 4, animals had a mean accuracy of 85% (SD = 11.6%), while artifacts had a mean accuracy of 78.3% (SD = 14.4%), t(59) = 3.8, p = .00039, 95% CI [3.1, 10], d = 0.49, d = 0.5, CL = 69% (see Figure 9). A Bayesian repeated measures ANOVA revealed anecdotal evidence for the alternate hypothesis for Category, BF01 = 0.84, and extreme evidence for the alternate hypothesis of Load, .

Figure 9.

Mean accuracy and RTs in Experiment 5A. Error bars show standard errors.

Mean accuracy and RTs in Experiment 5A. Error bars show standard errors. For the analysis of response times (RTs), we selected trials in which a correct response was made within 2,000 milliseconds from a change in orientation. An ANOVA on RT over Category (animal, artifact) and Load (Load 2, Load 4) showed significant main effects of Category, F(1, 59) = 20.48, p < .001, = .26, = .03, and Load, F(1, 59) = 230.86, p < .001, = .80, = .37. Their interaction was not significant, F(1, 59) < 0.01, p = .996, < .01, < .01. At Load 2, animals had a mean RT of 749.7 milliseconds (SD = 124.1 milliseconds), while artifacts had a mean RT of 808 milliseconds (SD = 132.7 milliseconds), t(59) = 5.5, p < .001, 95% CI [−80, − 37], d = 0.71, d = 0.45, CL = 76%. At Load 4, animals had a mean RT of 984.2 milliseconds (SD = 174.8 milliseconds), while artifacts had a mean RT of 1,042 milliseconds (SD = 182 milliseconds), t(59) = 2.5, p = .013, 95% CI [−100, −13], d = 0.33, d = 0.33, CL = 63%. A Bayesian repeated measures ANOVA revealed moderate evidence for the alternate hypothesis for Category, BF01 = 0.272, and extreme evidence for alternate hypothesis of Load, .

Experiment 5B

Before the analysis, we removed three participants for having mean accuracy below 65% (1.5 SD below the median). For the analysis of accuracy, we ran an ANOVA on Category (animal, artifact) and Load (Load 2, Load 4), which showed significant main effects of Category, F(1, 51) = 8.27, p = .006, = .14, = .02, and Load, F(1, 51) = 175.82, p < .001, = .78, = .53. Their interaction was also significant, F(1, 51) = 4.24, p = .045, = .08, < .01. At Load 2, animals had a mean accuracy of 98% (SD = 3.1%), while artifacts had a mean accuracy of 96.9% (SD = 4.57%), t(51) = 1.3, p = .19, 95% CI [−0.56, 2.8], d = 0.19, d = 0.29, CL = 57%. At Load 4, animals had a mean accuracy of 78.1% (SD = 15.6%), while artifacts had a mean accuracy of 72.8% (SD = 13%), t(51) = 2.7, p = .0093, 95% CI [1.3, 9.1], d = 0.37, d = 0.36, CL = 65% (see Figure 10). A Bayesian repeated measures ANOVA revealed anecdotal evidence for the null hypothesis for Category, BF01 = 2.145, and extreme evidence for the alternate hypothesis for Load, .

Figure 10.

Mean accuracy and RTs in Experiment 5B. Error bars show standard errors.

Mean accuracy and RTs in Experiment 5B. Error bars show standard errors. An ANOVA on RT over Category (animal, artifact) and Load (Load 2, Load 4) showed that the main effect of Category was not significant, F(1, 51) = 0.48, p = .491, < .01, < .01, and that the main effect of Load was, however, significant, F(1, 51) = 150.07, p < .001, = .75, = .36. Their interaction was not significant, F(1, 51) = 1.00, p = .322, = .02, < .01. At Load 2, animals had a mean RT of 788.7 milliseconds (SD = 117.3 milliseconds), while artifacts had a mean RT of 813.5 milliseconds (SD = 116.1 milliseconds), t(51) = 1.8, p = .07, 95% CI [−52, 2.1], d = 0.26, d = 0.21, CL = 60%. At Load 4, animals had a mean RT of 1,036 milliseconds (SD = 183.7 milliseconds), while artifacts had a mean RT of 1,033 milliseconds (SD = 198.9 milliseconds), t(51) = 0.098, p = .92, 95% CI [−51, 56], d = 0.014, d = 0.014, CL = 51%. A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 5.62, and extreme evidence for the alternate hypothesis for Load, . The results from Experiment 5A revealed that changes to animals were reported significantly more accurately and faster than changes to artifacts. This seems to be in agreement with the animate monitoring hypothesis (New et al., 2007) and to bring support to the idea that the act of monitoring objects for changes is an important aspect for observing a bias for animals. However, there is a possibility that the animal stimuli were somehow easier to monitor for changes than artifacts due to some uncontrolled factors pertaining to the chosen set of images. It is thus interesting to consider the results from Experiment 5B, which used a different set of images. Similar to the previous experiment, this experiment also appeared to reveal significantly more accurate reporting of changes to animals as compared with artifacts, but it did not replicate the observation of faster detections of animal changes. While the BF for the effect of category on accuracy was in agreement with the significance results of Experiment 5A, it did not agree with the significance results of Experiment 5B. The significance tests of Experiment 5B showed that animals were tracked significantly more accurately than artifacts, while the BF showed anecdotal evidence for the null hypothesis for the same data. Given this set of results and the fact that we have only anecdotal evidence for the alternate and null hypothesis across experiments (5A, 5B), the results on the effect of category on accuracy appear inconclusive. The evidence for an effect of category in RT was in fact moderate in both experiments, with Experiment 5A showing evidence for the alternate hypothesis while Experiment 5B showing evidence for the null hypothesis. Thus, we are here faced with a conflicting set of results. In summary, both experiments were inconclusive in relation to an effect of category on accuracy. Experiment 5A showed moderate evidence for an effect of category on RT, while Experiment 5B showed moderate evidence for no effect of category on RT. Specifically, the effect of images of animals on accuracy and RT would appear not to be robust or of considerable size, as well as appearing to be dependent on the stimuli used. While it is possible that humans are more sensitive to changes in lateral orientation of animals, such a requirement appears too specific in relation to the general advantage for animals we are seeking to find (New et al., 2007). In addition, we cannot rule out the effect some uncontrolled low-level aspects that somehow made the monitoring of the animals’ lateral orientation easier (e.g., that a protruding head and neck pointing in a certain direction could be easier to detect to have changed than objects not suggesting such directionality). Thus, it would seem appropriate to attempt to generalize the indications observed here to another type of change. Another type of change that might seem even more relevant in a survival scenario is changes in size. A change in size would intuitively signal that an animal is either getting closer or further away from the viewer, a situation which intuitively should be more relevant for survival than animals turning left and right. Thus, the next experiment was designed to directly address whether the advantage is dependent on the type of change participants were monitoring for.

Experiment 6

To assess whether the indication of an advantage for monitoring animals in Experiment 5 is specific to lateral (horizontal) orientation changes or can be generalized to another type of change that would be perhaps even more relevant in a survival scenario, we selected changes in size as another type of change to monitor for. Intuitively, changes in size provide visual cues for apparent distance of an object from an unmoving viewer so that a size-changing object may appear to move in depth during the change. As for the previous experiment, it is predicted that changes to animals would be detected more accurately and faster than changes to artifacts. Similar to the previous experiment, we also chose to conduct a replication in a parallel experiment (6B) with a different set of images. If the advantage for animals in Experiment 5 was related to an animate monitoring bias, then we would expect to find analogous effects in the present experiment. Conversely, if the animal advantage was specific to lateral changes and not related to an animate monitoring advantage, we expected no marked advantage for animals. In addition, we checked the post hoc hypothesis that increases in size should cue for increased proximity (e.g., looming; Schiff, Caviness, & Gibson, 1962) and thereby be more salient and pertinent to yielding an animal advantage than decreases in size. For Experiment 6A, we recruited 71 participants (10 women) with a mean age of 28 years (range: 18–58 years, SD = 7.49 years). For Experiment 6B, we recruited 64 participants (13 women) with a mean age of 31.2 years (range: 18–63 years, SD = 8.71 years). For Experiment 6A, we used the same set of images as in Experiment 5A. As this set was balanced on the number of pixels that would change in lateral inversions, it was not balanced on the amount of change introduced by changing their sizes. Thus, we resized the images such that the mean overall size, t(37.98) = 0.03, p = .973, number of dark pixels (darker than 0.6 on a gray scale from 0 to 1), t(33.5) = 0.81, p = .4, and amount of change in overall pixels, t(37.95) = 0.05, p = .957, and dark pixels, t(30.224) = 1.07, p = .3, introduced by scaling them 25% would not be significantly different between animals and artifacts. Part of this process involved matching individual images of animals and artifacts on size and amount of change such that they would be as similar as possible on those measures. The resulting pairs were then used in trial generation in which 40 unique sets of movement patterns (20 sets with two objects and 20 sets with four objects) were randomly selected for each participant from a pool of 240 paths. Then, for generating trials, an equal number of animals and artifacts were assigned to each set of movement patterns before target animals, change times, initial size (±12.5%), and type of change (implied by initial size, as initially small objects would increase in size while initially large objects would shrink in size by 25%) were assigned in a counterbalanced manner. For the artifact trials, we simply duplicated the animal trials to create another set of 40 trials but replaced animals with artifacts and artifacts with animals. To make the changes in size less conspicuous, we also changed the background to white (see Figure 11).

Figure 11.

Example of a trial in Experiment 6. First, targets were assigned (a), then all objects started moving around the display while frequently rotating 30° left and right (b), a change in size occurred (larger or smaller) at a random time point between 2 and 6 seconds from start of tracking (c; notice that the teapot increases in size). The movements stopped once a total of 8 seconds had elapsed or the observer pressed a button. The objects were immediately occluded behind black disks as the movements stopped (d). Participants then indicated the location of the changed object (e). Image of rabbit adapted with permission (Snodgrass & Vanderwart, 1980, pp. 197–204). For Experiment 6B, we selected, from a pool of 61 animals and 80 artifacts, 20 new pairs of animals (bear, beetle, bug, bull, camel, cow, duck, kangaroo, moose, mouse, parrot, rabbit, raccoon, seahorse, shark, snail, swan, turtle, and two wolves) and artifacts (bathtub, bicycle, flipper, harmonica, kitchen knife, kite, lamp, light bulb, motorbike, mousetrap, purse, rocking chair, skateboard, stroller, table, teapot, telescope, tractor, violin, and watering can). The new set were chosen, as it was more appropriate for balancing on visual properties deemed important with regard to size changes (number of pixels changing when resized; in contrast, the set selected for, e.g., Experiment 5B was balanced on number of pixels that changed when flipped left and right). The images were resized to be minimally different in number of overall pixels, t(38) = 0.02, p = .982, dark pixels, t(38) = 0.06, p = .95, and amount of change in overall pixels, t(38) = 0.02, p = .985, and dark pixels, t(37.975) = 0.015, p = .99, introduced by scaling them 25%. The trial generation process was identical to Experiment 6A. The overall procedure was identical to Experiment 5, with the exception that participants were instructed to report when one of the objects changed size. The change in size was either 25% smaller or larger.

Experiment 6A

Before analyzing the data, we removed three participants for having mean accuracy below 65% (1.5 SD below the median). An ANOVA on Category (animal, artifact) and Load (Load 2, Load 4) showed a nonsignificant main effect of Category, F(1, 67) = 0.11, p = .742, < .01, < .01, but a significant main effect of Load, F(1, 67) = 535.97, p < .001, = .89, = .62. The interaction was not significant, F(1, 67) = 0.40, p = .530, < .01, < .01. At Load 2, animals had a mean accuracy of 95.6% (SD = 5.51%), while artifacts had a mean accuracy of 94.6% (SD = 5.46%), t(67) = 1.2, p = .23, 95% CI [ − 0.66, 2.7], d = 0.15, d = 0.18, CL = 56%. At Load 4, animals had a mean accuracy of 69.5% (SD = 13.6%), while artifacts had a mean accuracy of 69.8% (SD = 12.6%), t(67) = 0.15, p = .88, 95% CI [−4.1, 3.6], d = 0.018, d = 0.022, CL = 51% (see Figure 12). A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 7.31, and extreme evidence for the alternative hypothesis for Load, .

Figure 12.

Mean accuracy and RTs in Experiment 6A. Error bars show standard errors.

Mean accuracy and RTs in Experiment 6A. Error bars show standard errors. An ANOVA on RT showed a nonsignificant main effect of Category, F(1, 67) = 0.14, p = .707, < .01, < .01, and a significant main effect of Load, F(1, 67) = 66.66, p < .001, = .50, = .12. The interaction was not significant, F(1, 67) = 0.02, p = .883, < .01, < .01. At Load 2, animals had a mean RT of 717.1 milliseconds (SD = 121.7 milliseconds), while artifacts had a mean RT of 710.8 milliseconds (SD = 122.1 milliseconds), t(67) = 0.49, p = .63, 95% CI [−19, 32], d = 0.059, d = 0.052, CL = 52%. At Load 4, animals had a mean RT of 820.9 milliseconds (SD = 172.7 milliseconds), while artifacts had a mean RT of 817.6 milliseconds (SD = 159.9 milliseconds), t(67) = 0.17, p = .86, 95% CI [−35, 41], d = 0.021, d = 0.02, CL = 51%. A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 7.0, and extreme evidence for the alternative hypothesis for Load, .

Experiment 6B

Before conducting the analysis, we removed six participants for having mean accuracy below 65%. An ANOVA on accuracy over Category (animal, artifact) and Load (Load 2, Load 4) showed a nonsignificant main effect of Category, F(1, 57) = 1.38, p = .245, = .02, < .01, but a significant main effect of Load, F(1, 57) = 464.36, p < .001, = .89, = .69. The interaction was not significant, F(1, 57) = 0.32, p = .576, < .01, < .01. At Load 2, animals had a mean accuracy of 95.1% (SD = 4.87%), while artifacts had a mean accuracy of 94.5% (SD = 5.17%), t(57) = 0.72, p = .48, 95% CI [−1.1, 2.3], d = 0.094, d = 0.12, CL = 54%. At Load 4, animals had a mean accuracy of 68% (SD = 10.9%), while artifacts had a mean accuracy of 66.1% (SD = 13.3%), t(57) = 0.94, p = .35, 95% CI [−2.1, 5.8], d = 0.12, d = 0.15, CL = 55% (see Figure 13). A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 6.1, and extreme evidence for the alternate hypothesis for Load, .

Figure 13.

Mean accuracy and RTs in Experiment 6B. Error bars show standard errors.

Mean accuracy and RTs in Experiment 6B. Error bars show standard errors. Similarly, an ANOVA on RT failed to reach significance for the main effect of Category, F(1, 57) = 0.78, p = .382, = .01, < .01, but reached significance for the main effect of Load, F(1, 57) = 117.22, p < .001, = .67, = .11. The interaction was also nonsignificant, F(1, 57) = 0.07, p = .786, < .01, < .01. At Load 2, animals had a mean RT of 717 milliseconds (SD = 132.9 milliseconds), while artifacts had a mean RT of 738 milliseconds (SD = 165.7 milliseconds), t(57) = 1.6, p = .13, 95% CI [−48, 6], d = 0.2, d = 0.13, CL = 58%. At Load 4, animals had a mean RT of 839.2 milliseconds (SD = 154.1 milliseconds), while artifacts had a mean RT of 851.4 milliseconds (SD = 225 milliseconds), t(57) = 0.38, p = .71, 95% CI [−77, 53], d = 0.05, d = 0.063, CL = 52%. A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 4.8, and extreme evidence for the alternate hypothesis for Load, .

Additional analysis

To assess the hypothesis that an animal’s increasing size should be particularly pertinent to yield an advantage, we ran an ANOVA on accuracy over Experiment (6A, 6B), Load (Load 1, Load 2), Category (animal, artifact), and Change (larger, smaller). Only the effect of Load reached significance, F(1, 124) = 1,008.26, p < .001, = .89, = .51. For brevity, we mention only the most relevant nonsignificant effects here (all other ps > .16); effect of Category, F(1, 124) = 1.32, p = .253, = .01, < .01, effect of Change, F(1, 124) = 0.91, p = .341, < .01, < .01, and interaction between Category and Change, F(1, 124) = 1.07, p = .304, < .01, < .01. A Bayesian repeated measures ANOVA showed strong evidence for the null hypothesis of Category, BF01 = 11.1 and Change, BF01 = 11.8. Finally, we ran the same ANOVA on RTs. Only the effect of Load reached significance, F(1, 119) = 156.77, p < .001, = .57, = .08. None of the other effects were significant; effect of Category, F(1, 119) = 0.13, p = .715, < .01, < .01, effect of Change, F(1, 119) = 0.56, p = .457, < .01, < .01, and interaction between Category and Change, F(1, 119) = 0.13, p = .723, < .01, < .01 (all other ps > .13). A Bayesian repeated measures ANOVA showed strong evidence for the null hypothesis of Category, BF01 = 12.4, and Change, BF01 = 10.3. None of the observations we had made in Experiment 5 were replicated. In these new experiments, animals were not reported to change significantly more accurately or faster than artifacts in either experiments. This was further confirmed by the obtained BFs which showed moderate evidence for the null hypothesis, supporting the conclusion that there was no effect of category on both accuracy and RTs. Moreover, the combined additional analysis revealed strong evidence for the null hypothesis. It thus appears that specific types of visual transformation can influence how well the stimuli used can be monitored for changes. As Experiments 5A and 6A used the same set of images, the advantage cannot be attributed to a particular set of images being easier to monitor for changes in general. Thus, changes in lateral orientation would seem more relevant for the hypothesized animate monitoring system than changes in size (or depth). Such a conclusion is, however, puzzling, as a change in size would intuitively signal that an animal is either getting closer or further away from the viewer and it would seem that this should be a more salient event (in survival terms) than seeing the same animal changing its direction to the left or right. Indeed, this consideration leads us to think that the results of Experiment 5 might have resulted from some low-level features’ changes that were more noticeable in the lateral orientation of animals. In addition, while it would seem intuitive that animals getting closer should be particularly capable in grabbing attention, we found no support for this. In relation to all, the null results of our previous experiments, as well as the findings with other paradigms, also showing no special role for animals in attention (Hagen & Laeng, 2016, 2017), it seems appropriate to suggest that what we observed in Experiment 5 is more likely to be related to aspects that are not directly related to a general prioritization in attentional processes (e.g., uncontrolled low-level features). Therefore, considering the conflicting results of Experiment 5 in context of the available evidence, we are inclined to conclude that Experiment 5 does not provide strong or conclusive evidence in favor of the animate monitoring mechanism. An interesting observation from the above experiments is that accuracy appears to be lower in combination with faster RTs as compared with Experiment 5. This could indicate that when changes in size were noticed, this occurred quite rapidly, while changes in lateral orientations appear to require more processing of the visual features for some length of time after the event, as reflected in longer RTs and higher accuracy. We should note that at present the MEM task is not a time-honored, standard, task in the cognitive sciences, thus not much is known about its limits and caveats. More research is needed to better understand the nature of the task and whether accuracy and RTs are actually indicative of attentional prioritizations of the depicted items. Such research may throw light on whether findings like those of Experiment 5 is likely to be a false positive in relation to studying the proposed innate attentional bias for animal stimuli, or conversely, the present experiment is likely to be a false negative.

Omnibus Analysis

To provide an overview of all results obtained in this study and make some principled conclusion, we conducted a final omnibus analysis; whereas several of our experiments failed to reach significance or provide conclusive evidence, they could still provide more robust indications of an animal or artifact bias when taken together. Although we based sample sizes on power-based statistical inference, one cannot exclude the possibility that the present experiments could have been underpowered, especially if there is a true effect that is smaller than the effect size originally estimated in the power analysis. Importantly, nonsignificant results are mostly inconclusive when considered alone. Thus, a more stringent way to reach a conclusion is to consider multiple experiments simultaneously, as we attempt to do next. Although it is statistically impossible to support an effect size of exactly zero (Lakens, 2017), it is possible to use an equivalence test (Lakens, 2017) to reject effect sizes above a specified limit by showing that our estimated effect sizes are statistically smaller than a specified equivalence bound (e.g., [−0.4, 0.4]). However, in absence of a concretely defined theoretical limit for a minimum effect size, we decided to set equivalence bounds to the range of effect sizes that we estimated to have 80% power to find equivalence for (Lakens, 2017) by simulating meta-analyses with heterogeneity similar to our combined experiments. In addition, to provide some context, we sought to quantify the effect of animal targets relative to another known effect in the field. Specifically, we compared it with the effect of increasing the number of targets (tracking load) in a standard MOT task. Previous work in our lab (Alnæs et al., 2014) has shown that either adding or removing one target leads to an average change in tracking performance of about 5.8%. To more specifically investigate the effect of category on tracking and monitoring accuracy, we ran an omnibus ANOVA across all experiments, which is similar to running a meta-analysis of raw mean differences (Bond, Wiitala, & Richard, 2003). More specifically, we combined the accuracy measures from Experiment 1 to Experiment 2 and Supplementary Experiments 1 and 3, the position accuracy measures from Experiment 3 to Experiment 4 and Supplementary Experiment 2, and the accuracy measures from Experiment 5 and 6. This omnibus analysis revealed significant main effects of Experiment, F(10, 647) = 30.78, p < .001, = .32, = .29 and Category, F(1, 647) = 6.13, p = .014, = .0093, = 0.0013. The interaction was also significant, F(10, 647) = 1.96, p = .036, = .03, = 0.0042. Equivalence testing on these measures showed that our meta-analytic effect size was not statistically smaller than a d of 0.16, p = .12. In addition, a Bayesian repeated measures ANOVA revealed only anecdotal and inconclusive evidence for the null hypothesis for Category, BF01 = 1.077, and extreme evidence for the alternative hypothesis of Experiment, . Animal targets had a mean accuracy of 78.4% (SD = 12.4%), while artifacts had a mean of 77.7% (SD = 11.9%). This is an advantage for animal targets of 0.71%, which amounts to a load-relative performance change of 12%, comparable to tracking about 0.12 more objects in a standard MOT task. Next, we inspected Cook’s distances for the individual effect sizes across experiments and found that measures in Experiments 5A and 5B were likely to have comparatively large impacts on our result. As discussed previously, we have raised doubts on the appropriateness of these experiments, considered in relation to their weak evidential value and conflicting results, the null results of Experiment 6, and the general set of results. Thus, to further assess the weight of Experiment 5 in reaching a significant result in our omnibus analysis, we removed it from the analysis. Running the same analysis without Experiment 5 showed a significant main effect of Experiment, F(8, 537) = 16.10, p < .001, = .19, = .17, but not for Category, F(1, 537) = 0.37, p = .54, = .0007, = .00008 or the interaction, F(8, 537) = 0.67, p = .72, = .01, = .001. Equivalence testing showed that our meta-analytic effect size was statistically smaller than d = 0.145, p = .002, indicating that we can reject effect sizes larger than 0.145. A Bayesian repeated measures ANOVA revealed strong evidence for the null hypothesis for Category, BF01 = 12.45, and extreme evidence for the alternate hypothesis for Experiment, . Animal targets had a mean accuracy of 76% (SD = 11.9%), while artifacts had a mean of 75.8% (SD = 11.7%). This is an advantage for animal targets of 0.2%, which is equivalent to tracking about 0.03 more objects in a standard MOT task. Next, we assessed identity accuracy across Experiments 3 and 4 and Supplementary Experiment 2. This showed significant main effects for Experiment, F(2, 187) = 18.67, p < .001, = .17, = .15, and Category, F(1, 187) = 6.29, p = .013, = .03, = .003, indicating that animal identities were reported more accurately than artifacts. The interaction was not significant, F(2, 187) = 0.24, p = .787, < .001, < .001. Equivalence testing showed that we cannot find statistical support for meta-analytic effect sizes smaller than d = 0.234, p = .36. A Bayesian repeated measures ANOVA revealed anecdotal evidence for the alternate hypothesis for Category, BF01 = 0.419, and extreme evidence for the alternate hypothesis of Experiment, . Animal targets had a mean identity accuracy of 61.5% (SD = 14.8%), while artifacts had a mean of 59.9% (SD = 14.8%). This is an advantage for animal targets of 1.6%, which can be compared to tracking about 0.27 more objects in a standard MOT task. When we assessed the same experiments for position accuracy, we found a significant main effect of Experiment, F(2, 187) = 16.22, p < .001, = .15, = .13, but no significant effect of Category, F(1, 187) = 0.03, p = .866, = 0.00015, < .0001, neither was the interaction significant, F(2, 187) = 1.05, p = .351, = .01, < .001. Equivalence testing showed that we can reject meta-analytic effect sizes larger than d = 0.273, p = .03. A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 8.96, and evidence for the alternate hypothesis for Experiment, . Animal targets had mean position accuracy of 74.5% (SD = 13.1%), while artifacts had a mean of 74.4% (SD = 12.5%). This is an advantage for animal targets of 0.1%, which is equivalent to tracking about 0.02 more objects in a standard MOT task. We also ran an omnibus ANOVA on percentages of incorrect responses in Experiments 1 to 4 and Supplementary Experiments 1, 2, and 3. This analysis showed nonsignificant main effects of Experiment, F(6, 408) < 0.001, p > .999, < .001, < .001, and Category, F(1, 408) = 0.06, p = .80, < .001, < .001. The interaction was also nonsignificant, F(6, 408) = 0.38, p = .89, < .01, < .01. Equivalence testing on this measure showed that we can reject meta-analytic effect sizes larger than d = 0.149, p = .003. A Bayesian repeated measures ANOVA revealed strong evidence for the null hypothesis for Category, BF01 = 11.1, and Experiment, BF01 = 2,592.3. Finally, we conducted an omnibus ANOVA on response orders across Experiments 1 to 2 and Supplementary Experiments 1 and 3. This analysis revealed a significant main effect of Experiment, F(3, 226) = 651.2, p < .001, = .90, = .87, and a nonsignificant main effect of Category, F(1, 226) = 1.35, p = .25, = .006, = .002. The interaction was also not significant, F(3, 226) = 0.46, p = .71, = .006, = .002. Equivalence testing showed that we can reject meta-analytic effect sizes larger than d = 0.20, p = .02. A Bayesian repeated measures ANOVA revealed moderate evidence for the null hypothesis for Category, BF01 = 4.8, and extreme evidence for the alternate hypothesis for Experiment, . In more concrete terms, ignoring averages per participant, we obtained 107,374 valid responses from 658 participants across all experiments (e.g., excluding responses before a change occurred in the MEM tasks). Of these, 40,620 were correctly reported animals and 40,457 were correctly reported artifacts (ignoring identity accuracies). This amounts to a difference in only 163 responses. Thus, across 81,077 human attentional engagements with animal stimuli, just 0.2% of them represent our overall observed bias in tracking and monitoring accuracy (or 0.67% by exchanging position accuracy for identity accuracy). A similar count on incorrect responses revealed that 11,790 animal distractors and 11,789 artifact distractors were incorrectly reported as targets, a difference of only one response. Based on the above set of analyses, we cannot conclude that animals had a sizable influence on performance across tasks. Although we cannot entirely conclude that animals had no effect across tasks either, it turned out that in a majority of our tests, we got moderate to strong evidence for no effect of animals, while only inconclusive evidence for an effect in omnibus analyses that involved Experiment 5 and identity accuracy. Hence, it seems difficult to interpret these results as robustly supporting the animate monitoring hypothesis. All in all, we found evidence for no effect of images of animals in omnibus ANOVAs on tracking and monitoring accuracy (depending on the exclusion of Experiment 5), percentages of incorrect responses, and response orders. In fact, we found support for effect sizes smaller than what we had 80% power to find equivalence for. However, our data were not sensitive enough to conclude whether animals had an influence on identity accuracy or not. If the indicated effect sizes are deemed interesting and worthwhile, future studies should aim to include more participants than in this study as well as reassessing the validity of the design of Experiment 5.

General Discussion

On the basis of the animate monitoring hypothesis (New et al., 2007), we have used several versions of the MOT, MIT, and MEM tasks to investigate (a) whether images of animals can improve position and identity tracking, (b) whether they can act as more effective distractors, (c) whether they are selected prior to artifacts in the response phase, and (d) whether they are easier to monitor for changes. In the first three experiments with MOT and MIT tasks, we failed to reject the null hypothesis of no advantage or bias for animal targets and distractors. In fact, we found evidence in support of the null hypothesis. In Experiment 4, however, we did uncover a significant advantage for animal targets in identity accuracy but not in position accuracy. Remarkably, we observed the opposite: a significant advantage for artifacts. A Bayesian test did, however, show that the evidence for such effects was only weak and inconclusive. In the following two experiments (5A, 5B), instructing participants to monitor for lateral orientation changes, seemingly uncovered a pattern of results consistent with the animate monitoring hypothesis. As animals were reported significantly more accurately than artifacts in both experiments, and animal targets were reported significantly faster than artifacts in Experiment 5A. However, a Bayesian test did not support such a conclusion, as the evidence for an effect of category on accuracy turned out to only be anecdotal in both experiments. The effect of category on RT was also conflicting, as the first experiment (5A) showed moderate evidence for the alternate hypothesis while the latter (5B) showed moderate evidence for the null hypothesis. To follow up on these observations and to attempt to rule out that the indications of an advantage was specific to the type of change used, we substituted the lateral orientation changes with changes in size (as an index of proximity). This substitution appeared to be crucial for an advantage for animals in accuracy and RTs, as we consistently found moderate evidence for the null hypothesis of no effect of category. In fact, this pattern of results lends credibility to the interpretation that the results of Experiment 5 could represent a potential false positive in relation to an attentional bias for animals. More specifically, given the fact that Experiment 6A used the same set of images as Experiment 5A and failed to replicate any bias, this may support the interpretation that the specific task in combination with the image set could have had effects that were actually independent of a specific attentional bias to animals (e.g., low-level features being more salient in lateral views of animals than artifacts in general). While some of our results could potentially be viewed as false negatives or, at least, as not sensitive enough to reveal the supposed attentional bias, it seems that if a true effect of images of animals on our measures exists at all, it is likely to be small. Based on our omnibus approach, we failed to obtain robust evidence for appreciable effect sizes across measures for position accuracy, incorrect responses (distractibility), response order (response prioritization), as well as monitoring RTs and accuracy. We did, however, find indications of improved identity accuracy for animal targets, but this effect was difficult to consider as evidence for an attentional bias given the overall context of the results. In fact, our degree of evidence in support of the presence of an effect for animals is largely dependent on the inclusion of Experiment 5, thus it seems important that future studies assess the validity of this experimental design in studying attentional biases. In fact, we can reject small-to-medium effect sizes with equivalence bounds when disregarding Experiment 5 in an omnibus analysis. Similarly, we found support for rejecting effect sizes considerably smaller than what we expected to find for measures on location accuracy, incorrect responses, and response orders. In the end, it cannot be ruled out that if there is a true effect, it might be smaller than our equivalence bounds and what our study was powered for. Hence, studies with a considerably higher level of statistical power should be conducted in future investigations, assuming that the potential for such small effects is deemed interesting and worthwhile. We need to point out that we have attempted to tax and challenge the attentional system responsible for keeping track of objects in various ways to bring about errors such that a category-specific prioritization may become detectable. However, it seems that a robust advantage or distracting effect of animals is not easily obtained in tasks requiring divided sustained attention. Rather, it seems that special conditions may need to be met to detect a sizable advantage, which raises the question of the validity and aptness of those conditions in revealing an attentional bias. In fact, the observed effect sizes across experiments were considerably smaller than what the authors of the original hypothesis designed for in their seminal study (New et al., 2007). Clearly, we cannot entirely rule out the presence of a small effect of animal targets. One should, however, consider the extent in which such a weak effect has practical implications for survival as well as the validity of the measures used to obtain such an effect. In particular, it is difficult to envision that lateral orientation changes should be especially sensitive to an attentional bias for animals in relation to the other null results and the original formulation of the hypothesis. In fact, it would seem necessary to reformulate the hypothesis, in order to provide a reason for why the attentional bias should be more sensitive to changes in lateral orientation of animals, without significantly improving tracking performance, or biasing response orders, or allowing them to act as more effective distractors than artifacts. For these reasons, it seems to us difficult to argue that an evolved animal monitoring circuit should be especially sensitive to changes in lateral orientation instead of the size (or proximity) of an animal. In fact, it does not seem particularly adaptive for a putative monitoring system to be specifically sensitive to some features that do not explicitly appear as more relevant for survival than others. Moreover, we cannot assume that performance in the MEM task with lateral orientation changes can be seen as exclusively grounded in attentional ability or attentional processes. There is the possibility that an advantage in detecting the type of changes made to animals over artifacts is based on cues provided by low-level features that were present in the sets of stimuli and not due to their category per se. However, we cannot rule out that sensitivity to specific low-level features could have been selected itself by natural selection to assist the detection of a specific category of objects (e.g., eye-like shapes. suggesting different directions of movement). In addition, it seems highly relevant to judge the present evidence in the light of other experiments and experimental paradigms that have examined the prioritization of animal stimuli in attention. In particular, in our laboratory, we have previously used tasks such as change detection (Hagen & Laeng, 2016) and attentional blink (Hagen & Laeng, 2017), which have both led us to question the animate monitoring hypothesis in relation to prioritization of attentional mechanisms. Other researchers have employed inattentional blindness tasks, which apparently showed an advantage for animals (Calvillo & Hawkins, 2016; Calvillo & Jackson, 2014). However, our previous study with images of animals in an attentional blink task showed that animals have no considerable impact on attentional blinks (Hagen & Laeng, 2017), but that they are reported more successfully regardless of their temporal position, suggesting instead an advantage in perceptual processing or encoding. That is, animal stimuli are unable to surpass the blindness of the attentional blink or spontaneously induce such blinks, while other stimuli considered biologically important (e.g., arousing words, facial expressions, or food) do seem to be able to do this. We note that another line of work (Pratt et al., 2010) has investigated the ability of arbitrarily shaped objects (squares) to capture attention when they abruptly changed motion patterns from predictable trajectories to unpredictable animate trajectories. The changes were coupled with a type of change detection task, where participants reported a vanishing square shortly after the change in motion pattern occurred. The experiments quite successfully demonstrated that such animate motion patterns are able to capture attention. An interesting question in relation to this study is whether such changes in motion patterns would be more readily detected if the objects were depicted as animals as opposed to artifacts. It is also worth noting that this study used random motion trajectories, which according to Pratt et al. (2010), should signal animate motion, as animates rarely move in predictable or lawful ways (they are self-propelled rather than moving under Newtonian physics). While our designs are markedly different from that of Pratt et al. (2010), one may raise the concern that the random motion patterns employed in the present experiments could have signaled animacy for the animate monitoring system. Hence, participants could have assigned equal priority to all objects, irrespective of what they depicted, effectively erasing any potential for a bias for images of animals. To attempt to resolve this concern, we have included Supplementary Experiment 3, which found no discernible evidence for a bias toward objects moving with random and unpredictable directional changes as compared with objects moving predictably. Importantly, it did not show that images of animals were biased when moving with predictable motions. In fact, the evidence was markedly incompatible with the aforementioned concern. Another potential concern with the design of the present experiments is that the system responsible for assigning attentional biases to animals could have been overtaxed by the number of animals displayed simultaneously. However, considering that research employing a large number of facial stimuli in similar tasks have repeatedly been successful in showing attentional biases (Jin & Xu, 2015; Li et al., 2016, 2017; Liu & Chen, 2012), this does not appear as a significant concern, and even less so regarding our last two experiments. Future studies could, however, attempt, with appropriate balancing, trials with just one animal among multiple nonanimals to discern whether it will be prioritized more than a single nonanimal or multiple animals. Finally, we would also like to note that, in contrast to our individual controlled experiments, the actual natural environment poses a complex and varied set of challenges. The present experiments represent only a modest probe in comparison to a noisy environment where animal monitoring may be relevant under different circumstances. Thus, researchers should pose the question of whether attentional biases for animals are even measurable in laboratory settings. Perhaps using a computer monitor showing images of animals cannot bring about robust evidence for an attentional bias, as these are confined to the use of symbolic representations, not the natural stimuli themselves. It is possible that our perceptual and attentional systems are perfectly able to make a clear distinction between image-like representations or depictions and the real objects (e.g., a deer vs. a car entering your visual field). Perhaps only the natural stimuli are fully able to engage perceptual or cognitive mechanisms in a manner consistent with the theory. However, we deem this possibility unlikely, as most of the evidence in favor of evolutionary processes on perception (e.g., properties of faces) and on attention (e.g., pop-out features) have been based on laboratory demonstrations with flattened, often abstract, and symbolic depictive items. Nevertheless, our point is that drawing conclusions from controlled settings to naturalistic behavior should be done with considerable caution and research in more naturalistic settings should be considered for future research, before attempting to conclude on how humans behave in the natural world when encountering other animals.

Conclusion

We have studied the role of animals in attention by challenging over 600 participants with several variations of visual tracking tasks, all requiring divided and sustained attention. Following the reasoning behind the animate monitoring hypothesis, we expected to find that associating positions with images of animals would lead to more accurate tracking, more vigilant monitoring, prioritized responses, and that animals would function as more effective distractors than artifacts. The combined results are, however, not strongly or unequivocally supportive of these expectations. Although some observations were in favor of the animate monitoring hypothesis, these could not be regarded as more than weak and inconclusive evidence. Indeed, the indicated effect sizes across experiments were considerably smaller than what we expected to find from current theory and previous research. We found moderate to strong evidence that images of animals do not improve positional tracking, do not act as more effective distractors, are not selected prior to artifacts in the response phase, and are not easier to monitor for changes in size. Click here for additional data file. Supplemental material for Chasing Animals With Split Attention: Are Animals Prioritized in Visual Tracking? by Thomas Hagen, Thomas Espeseth and Bruno Laeng in i-Perception

52 in total

1. Position tracking and identity tracking are separate systems: Evidence from eye movements.

Authors: Lauri Oksama; Jukka Hyönä
Journal: Cognition Date: 2015-11-09

2. Tracking unique objects.

Authors: Todd S Horowitz; Sarah B Klieger; David E Fencsik; Kevin K Yang; George A Alvarez; Jeremy M Wolfe
Journal: Percept Psychophys Date: 2007-02

3. Animate and inanimate objects in human visual cortex: Evidence for task-independent category effects.

Authors: Alison J Wiggett; Iwan C Pritchard; Paul E Downing
Journal: Neuropsychologia Date: 2009-07-23 Impact factor: 3.139

4. The animacy continuum in the human ventral vision pathway.

Authors: Long Sha; James V Haxby; Herve Abdi; J Swaroop Guntupalli; Nikolaas N Oosterhof; Yaroslav O Halchenko; Andrew C Connolly
Journal: J Cogn Neurosci Date: 2014-09-30 Impact factor: 3.225

5. Living and non-living categories. Is there a "normal" asymmetry?

Authors: E Capitani; M Laiacona; R Barbarotto; C Trivelli
Journal: Neuropsychologia Date: 1994-12 Impact factor: 3.139

Review 6. Concepts and categories: a cognitive neuropsychological perspective.

Authors: Bradford Z Mahon; Alfonso Caramazza
Journal: Annu Rev Psychol Date: 2009 Impact factor: 24.137

7. Category-specific attention for animals reflects ancestral priorities, not expertise.

Authors: Joshua New; Leda Cosmides; John Tooby
Journal: Proc Natl Acad Sci U S A Date: 2007-10-01 Impact factor: 11.205

8. Rethinking the basic-applied dichotomy.

Authors: Jeremy M Wolfe
Journal: Cogn Res Princ Implic Date: 2016-09-22

Review 9. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.

Authors: Daniël Lakens
Journal: Front Psychol Date: 2013-11-26

10. Cortical Circuit for Binding Object Identity and Location During Multiple-Object Tracking.

Authors: Lauri Nummenmaa; Lauri Oksama; Erico Glerean; Jukka Hyönä
Journal: Cereb Cortex Date: 2017-01-01 Impact factor: 5.357

2 in total

Review 1. Multiple-target tracking in human and machine vision.

Authors: Shiva Kamkar; Fatemeh Ghezloo; Hamid Abrishami Moghaddam; Ali Borji; Reza Lashgari
Journal: PLoS Comput Biol Date: 2020-04-09 Impact factor: 4.475

2. Animate and Inanimate Words Demonstrate Equivalent Retrieval Dynamics Despite the Occurrence of the Animacy Advantage.

Authors: Michael J Serra
Journal: Front Psychol Date: 2021-06-03

2 in total