| Literature DB >> 28727743 |
James R H Cooke1, Arjan C Ter Horst1, Robert J van Beers1,2, W Pieter Medendorp1.
Abstract
Many daily situations require us to track multiple objects and people. This ability has traditionally been investigated in observers tracking objects in a plane. This simplification of reality does not address how observers track objects when targets move in three dimensions. Here, we study how observers track multiple objects in 2D and 3D while manipulating the average speed of the objects and the average distance between them. We show that performance declines as speed increases and distance decreases and that overall tracking accuracy is always higher in 3D than in 2D. The effects of distance and dimensionality interact to produce a more than additive improvement in performance during tracking in 3D compared to 2D. We propose an ideal observer model that uses the object dynamics and noisy observations to track the objects. This model provides a good fit to the data and explains the key findings of our experiment as originating from improved inference of object identity by adding the depth dimension.Entities:
Mesh:
Year: 2017 PMID: 28727743 PMCID: PMC5519009 DOI: 10.1371/journal.pcbi.1005554
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Schematic representation of multiple object tracking task.
(a) Multiple object tracking. Subjects tracked three indicated objects before responding whether or not a probed object was a target. (b) Example trajectories of four objects through virtual space. Dashed lines indicate object trajectories over 1.5s, disks indicate trajectory start and the grey plane indicates the screen. The blue and red lines represent part of the 2D and 3D trajectories, respectively. Trajectories were taken from trials with σ = 4° and σ = 0.2° per frame.
Experimental sessions with σ, σ, and depth conditions.
| Exp | Dim | ||
|---|---|---|---|
| 2 | 0.005, 0.02, 0.035, 0.05, 0.065, 0.08, 0.1, 0.12, 0.156, 0.2 | 2D | |
| 3 | 0.005, 0.0267, 0.0483, 0.07, 0.0917, 0.1133, 0.135, 0.1567, 0.1783, 0.2 | 2D | |
| 4 | 0.005, 0.0267, 0.0483, 0.07, 0.0917, 0.1133, 0.135, 0.1567, 0.1783, 0.2 | 2D | |
| 2 | 0.005, 0.02, 0.035, 0.05, 0.065, 0.08, 0.1, 0.12, 0.156, 0.2 | 3D | |
| 3 | 0.005, 0.0267, 0.0483, 0.07, 0.0917, 0.1133, 0.135, 0.1567, 0.1783, 0.2 | 3D | |
| 4 | 0.005, 0.0267, 0.0483, 0.07, 0.0917, 0.1133, 0.135, 0.1567, 0.1783, 0.2 | 3D |
Fig 2Schematic representation of ideal observer MOT model.
White circles indicate the model’s estimate of an object state; black circles indicate their noisy perceptual measurements (indicated by σ). Grey squares indicate the predictions of their future state. represents the process noise variance used for prediction. The initial model estimates are set to the objects’ start position. The model proceeds to track the objects for the duration of the trial. This is accomplished by predicting the future state using the process dynamics and combing this with noisy perceptual measurements to determine the measurement assignments. The assignments are used to estimate the objects’ state. The estimates are used to generate predictions and the process is repeated until the end of a trial. At the end of a trial the model is probed to test if objects were correctly tracked. This was done by drawing a random sample from the position of one object and calculating the probability this sample came from a target or non-target.
Fig 3Simplified model illustration.
(A) NE-cd model run on object trajectories for two objects, including confusion of objects. The red and blue dashed lines represent the actual trajectory for two different objects, the green line indicates the model’s position estimate of one object. The vertical black line indicates the time point of the data used in B. (B) Likelihood of a measurement coming from object 1 (blue) and object 2 (red) in the 2D case (left) and 3D case (right). Contour plots represent four slices of the two-dimensional likelihood function, evenly spread from the minimum to maximum likelihoods. Similar likelihoods in one dimension can be disambiguated in the other dimension. (C) Contour plots of predicted state and covariance given the posterior distribution of previous time step (black) for FE (magenta) and NE-cd (cyan) model, NE-cd does not use velocity information in the prediction, leading to biases towards zero velocity and position. Data was generated with σ = 2° and σ = 0.2° per frame using the best fit parameters from the NE-cd model (see Table 2).
Maximum Likelihood parameters and quality of fit for the four models.
| Model | c (m) | d (m) | Relative log likelihood |
|---|---|---|---|
| 0.020 | 0.0169 | -580.74 | |
| 0.0082 | 0.0202 | 0 | |
| 0.010 | 0.0250 | -32.80 | |
| 0.0014 | 0.0101 | -69.91 |
c is a free scaling parameter for position noise in the frontoparallel plane, d is a free scaling parameter for position noise in the depth plane.
Fig 4Accuracy data from tracking experiment.
Data and fitted psychometric curves for a single subject (left) and group data (right). Data points indicate percentage of correct responses. Error bars indicate 1 standard error calculated across subjects. Shaded areas indicate 1 standard error of psychometric curves across subjects. Dashed lines indicate σ value for 75% correct performance used for comparison across conditions.
Fig 5Distance and depth interaction.
Interaction between depth and distance for the σ value required for 75% correct performance in 2D and 3D conditions for the different models. Solid lines indicate 3D conditions, dashed lines the 2D conditions, black lines indicate data and the colors represent model predictions (Blue: FE, green: NE-cd, orange: NE-id and yellow: NV). Error bars indicate one standard error.
Fig 6Model predictions.
Data points indicate percentage of correct responses for this stimulus combination. FE is full extrapolation model, NE-cd is the no extrapolation with correct dynamics model, NE-id is the no extrapolation with incorrect dynamics model and NV is no velocity model. Blue and red lines indicate percentage correct predictions for 2D and 3D conditions, respectively. Error bars indicate one standard error.