| Literature DB >> 36103558 |
Christopher DiMattina1, Josiah J Burnham2, Betul N Guner3, Haley B Yerxa2.
Abstract
In order to accurately parse the visual scene into distinct surfaces, it is essential to determine whether a local luminance edge is caused by a boundary between two surfaces or a shadow cast across a single surface. Previous studies have demonstrated that local chromatic cues may help to distinguish edges caused by shadows from those caused by surface boundaries, but the information potentially available in local achromatic cues like contrast, texture, and penumbral blur remains poorly understood. In this study, we develop and analyze a large database of hand-labeled achromatic shadow edges to better understand what image properties distinguish them from occlusion edges. We find that both the highest contrast as well as the lowest contrast edges are more likely to be occlusions than shadows, extending previous observations based on a more limited image set. We also find that contrast cues alone can reliably distinguish the two edge categories with nearly 70% accuracy at 40x40 resolution. Logistic regression on a Gabor Filter bank (GFB) modeling a population of V1 simple cells separates the categories with nearly 80% accuracy, and furthermore exhibits tuning to penumbral blur. A Filter-Rectify Filter (FRF) style neural network extending the GFB model performed at better than 80% accuracy, and exhibited blur tuning and greater sensitivity to texture differences. We compare human performance on our edge classification task to that of the FRF and GFB models, finding the best human observers attaining the same performance as the machine classifiers. Several analyses demonstrate both classifiers exhibit significant positive correlation with human behavior, although we find a slightly better agreement on an image-by-image basis between human performance and the FRF model than the GFB model, suggesting an important role for texture.Entities:
Mesh:
Year: 2022 PMID: 36103558 PMCID: PMC9512248 DOI: 10.1371/journal.pcbi.1010473
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.779
AO confusion matrix for all LB experiments.
| LB-1 | LB-2 | LB-3 | ||||
|---|---|---|---|---|---|---|
| predicted | predicted | predicted | ||||
|
|
|
|
|
|
|
|
|
| 69 | 31 | 73 | 27 | 83 | 17 |
|
| 27 | 73 | 22 | 78 | 19 | 81 |
Spearman correlation between the log-odds ratio obtained from the aggregate observer (AO) and the machine classifiers over the N = 200 images in each survey.
All values are significant with p < 0.001.
| survey | GFB | FRF |
|---|---|---|
| QT-1 | 0.604 | 0.696 |
| LB-1 | 0.475 | 0.545 |
| QT-2 | 0.568 | 0.587 |
| LB-2 | 0.610 | 0.627 |
| QT-3 | 0.697 | 0.775 |
| LB-3 | 0.743 | 0.767 |
AO confusion matrix for all QT experiments.
| QT-1 | QT-2 | QT-3 | ||||
|---|---|---|---|---|---|---|
| predicted | predicted | predicted | ||||
|
|
|
|
|
|
|
|
|
| 88 | 12 | 74 | 26 | 88 | 12 |
|
| 19 | 81 | 21 | 79 | 24 | 76 |