| Literature DB >> 27857160 |
Artem Platonov1, Guy A Orban1.
Abstract
Little is presently known about action observation, an important perceptual component of high-level vision. To investigate this aspect of perception, we introduce a two-alternative forced-choice task for observed manipulative actions while varying duration or signal strength by noise injection. We show that accuracy and reaction time in this task can be modeled by a diffusion process for different pairs of action exemplars. Furthermore, discrimination of observed actions is largely viewpoint-independent, cannot be reduced to judgments about the basic components of action: shape and local motion, and requires a minimum duration of about 150-200 ms. These results confirm that action observation is a distinct high-level aspect of visual perception based on temporal integration of visual input generated by moving body parts. This temporal integration distinguishes it from object or scene perception, which require only very brief presentations and are viewpoint-dependent. The applicability of a diffusion model suggests that these aspects of high-level vision differ mainly at the level of the sensory neurons feeding the decision processes.Entities:
Mesh:
Year: 2016 PMID: 27857160 PMCID: PMC5114682 DOI: 10.1038/srep36742
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Overview of experiments 1–6.
| Experiment | Tested actions | PD | SL | Subjects |
|---|---|---|---|---|
| 1 | action pair 1 | 2o | 0, 10, 20, 30, 50 and 70% | S1–S5 |
| 13o | 0, 6.25, 12.5, 20, 25 and 50% | S6–S9 | ||
| 2 | action pair 2 | 13o | 0, 6.25, 12.5, 20, 25 and 50% | S2, S3, S6 and S9 |
| 3 | action pair 1 | 13o | 0, 6.25, 12.5, 20, 25 and 50% | S10-S21 |
| 4 | action pair 1 | 2o | 0, 10, 20, 30, 50 and 70% | S1–S3 and S22 |
| 5 | action pair 1 and 2 | 13o | 0, 6.25, 12.5, 25 and 50% | S2, S3, S9 and S22 |
| 6 | action pair 2 | — | 100% | S23–S32 |
Figure 1Experiment 1.
Response time (circles, upper rows) and accuracy (triangles, lower rows) plotted as a function of signal strength for 2AFC discrimination of rotating-rolling by subjects S1–S9. The proportional-rate diffusion model provided a close fit (solid lines) to the data in all subjects. Dashed lines indicate halfway response time and 75% accuracy thresholds (see Table 2). Error bars indicate ±1 SEM.
Parameter values calculated for proportional-rate diffusion model, fitting the results of experiments 1 and 2 (A’ = normalized bound; k = sensitivity; tR = mean residual time in s), threshold ratio, estimated 75% accuracy threshold, and quality of fit (L = likelihood).
| Subject | Threshold ratio | Threshold (75%) | ln(L) | ||||
|---|---|---|---|---|---|---|---|
| Exp 1 | S1 | 1.53 | 28.7 | 0.30 | 3.50 | 12.5 | 23.6 |
| S2 | 1.45 | 27.1 | 0.33 | 3.49 | 14.0 | 15.8 | |
| S3 | 1.22 | 37.8 | 1.14 | 3.50 | 11.9 | 12.1 | |
| S4 | 1.26 | 31.6 | 0.76 | 3.49 | 13.8 | 15.4 | |
| S5 | 0.78 | 49.1 | 1.06 | 3.49 | 14.4 | 14.8 | |
| S6 | 1.38 | 16.9 | 0.13 | 3.49 | 23.7 | 5.71 | |
| S7 | 0.93 | 36.0 | 0.76 | 3.48 | 16.5 | 9.96 | |
| S8 | 1.51 | 19.5 | 0.04 | 3.49 | 18.7 | 5.89 | |
| S9 | 1.25 | 21.1 | 0.43 | 3.49 | 20.9 | 24.8 | |
| Mean (SD) | 1.11 (0.30) | 29.8 (10.2) | 0.55 (0.40) | 3.49 (0.01) | 16.3 (4.04) | ||
| Exp 2 | S2 | 1.04 | 45.2 | 1.50 | 3.48 | 11.7 | 10.2 |
| S3 | 1.31 | 40.2 | 1.23 | 3.50 | 10.4 | 32.5 | |
| S6 | 0.73 | 69.0 | 1.40 | 3.50 | 10.9 | 11.4 | |
| S9 | 1.38 | 60.7 | 1.11 | 3.47 | 6.6 | 14.1 | |
| Mean (SD) | 1.26 (0.24) | 53.8 (13.4) | 1.31 (0.17) | 3.49 (0.02) | 9.9 (2.26) |
Figure 2Experiment 2.
Response time (circles, upper rows) and accuracy (triangles, lower rows) plotted as a function of signal strength for discrimination of grasping-dragging by subjects S2, S3, S6, S9. To facilitate the comparison between the 2 action pairs, results from the same subjects obtained in experiment 1 are also plotted. Neither the halfway response times nor 75% accuracy threshold, calculated from the proportional-rate diffusion model fit (lines) differed significantly between two action pairs tested. Same conventions as Fig. 1.
Experiment 3.
| Subject | IT | MT | FT | VS | |
|---|---|---|---|---|---|
| Group 1 | S10 | 40.5 | 43.8 | 36.6 | 31.6 |
| S11 | 47.0 | 34.2 | 27.8 | 23.1 | |
| S12 | 42.0 | 34.3 | 36.3 | 34.3 | |
| S13 | 19.8 | 22.7 | 18.4 | 14.6 | |
| S14 | 34.6 | 39.8 | 24.5 | 21.1 | |
| S15 | 33.5 | 25.5 | 19.3 | 16.9 | |
| Mean ± SD | 36.2 ± 9.46 | 33.4 ± 8.10 | 27.2 ± 7.98 | 23.6 ± 7.89 | |
| Group 2 | S16 | 42.4 | 24.3 | 20.5 | 26.7 |
| S17 | 24.8 | 22.1 | 20.6 | 26.9 | |
| S18 | 18.8 | 28.3 | 18.8 | 23.7 | |
| S19 | 21.6 | 27.5 | 20.9 | 22.5 | |
| S20 | 18.3 | 30.1 | 18.2 | 22.7 | |
| S21 | 24.2 | 23.7 | 19.2 | 17.2 | |
| Mean ± SD | 25.0 ± 8.93 | 26 ± 3.09 | 19.7 ± 1.11 | 23.3 ± 3.55 |
Accuracy thresholds in the initial training session (IT), middle training session (MT), final training session (FT) and after a viewpoint switch (VS) for group 1 & 2 subjects.
Two-way ANOVA of IT, MT and FT thresholds: Main effect Session (IT, MT, FT): (F2, 35 = 3.57, p < 0.05); Main effect: Group (1, 2). (F1, 35 = 13.7, p < 0.01); Interaction. (F2, 35 = 0.28, p > 0.75).
Experiment 3.
| Subject | MT | FT | VS | |
|---|---|---|---|---|
| Group 1 | S10 | 152.7 | 127.7 | 110.2 |
| S11 | 119.3 | 96.8 | 80.5 | |
| S12 | 119.6 | 126.7 | 119.5 | |
| S13 | 79.3 | 64.1 | 50.9 | |
| S14 | 138.7 | 85.3 | 73.7 | |
| S15 | 89.1 | 67.1 | 58.9 | |
| Mean ± SD | 116.5 ± 28.1 | 94.6 ± 27.9 | 82.3 ± 27.5 | |
| Group 2 | S16 | 84.8 | 71.6 | 93.1 |
| S17 | 77.1 | 71.7 | 93.7 | |
| S18 | 98.6 | 65.5 | 82.7 | |
| S19 | 95.8 | 72.9 | 78.3 | |
| S20 | 104.9 | 63.6 | 79.3 | |
| S21 | 82.7 | 66.8 | 60.0 | |
| Mean ± SD | 90.7 ± 10.7 | 68.7 ± 3.87 | 81.2 ± 12.3 |
Half-way response time thresholds in middle training session (MT), final training session (FT) and after a viewpoint switch (VS) for group 1 & 2 subjects.
Figure 3Experiment 4.
Accuracy in the manipulative action discrimination task for subjects S1–S3 and S22 in the action, static and dynamic conditions. Positive and negative signal values represent rotation and rolling actions, respectively. A logistic regression fit to the data is superimposed.
Figure 4Experiment 5.
Accuracy in discriminating between rotation and rolling (S2 and S22) and grasping and dragging (S3 and S9) plotted for the action and static stimuli depicting beginning, middle and end time points. Positive and negative signal values represent rotation and rolling, and dragging and grasping actions, respectively. A logistic regression fit to the data is superimposed. 75% accuracy thresholds (in SL) are indicated by numbers in italic at the top left of the plots.
Figure 5Experiment 6.
Average (n = 10) % dragging responses as a function of duration of the observed action: positive and negative durations represent observed dragging and grasping respectively. The logistic regression fit to the data is again superimposed. Vertical bars indicate SEM. 75% and 84% thresholds (dashed lines) averaged 138 ms and 216 ms respectively.