| Literature DB >> 21886618 |
Abstract
Incompatible images presented to the two eyes lead to perceptual oscillations in which one image at a time is visible. Early models portrayed this binocular rivalry as involving reciprocal inhibition between monocular representations of images, occurring at an early visual stage prior to binocular mixing. However, psychophysical experiments found conditions where rivalry could also occur at a higher, more abstract level of representation. In those cases, the rivalry was between image representations dissociated from eye-of-origin information, rather than between monocular representations from the two eyes. Moreover, neurophysiological recordings found the strongest rivalry correlate in inferotemporal cortex, a high-level, predominantly binocular visual area involved in object recognition, rather than early visual structures. An unresolved issue is how can the separate identities of the two images be maintained after binocular mixing in order for rivalry to be possible at higher levels? Here we demonstrate that after the two images are mixed, they can be unmixed at any subsequent stage using a physiologically plausible non-linear signal-processing algorithm, non-negative matrix factorization, previously proposed for parsing object parts during object recognition. The possibility that unmixed left and right images can be regenerated at late stages within the visual system provides a mechanism for creating various binocular representations and interactions de novo in different cortical areas for different purposes, rather than inheriting then from early areas. This is a clear example how non-linear algorithms can lead to highly non-intuitive behavior in neural information processing.Entities:
Keywords: binocular rivalry; blind source separation; independent component analysis; non-linear dynamical systems; non-negative matrix factorization
Year: 2011 PMID: 21886618 PMCID: PMC3152723 DOI: 10.3389/fnhum.2011.00078
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Figure 1Schematic of image mixing and unmixing process.
Figure 2Mechanics of the unmixing algorithm. (Ai) Matrix representation of binocular mixing. The binocular mixture matrix B had five columns, representing the five mixed images depicted in Figure 1. Each column had 40,000 rows, corresponding to 40,000 pixels in each image (200 × 200 pixels). Thus each image is “unfolded” from a 2D array to a 1D column of pixels. The binocular matrix B was factored into two non-negative matrices M and A such that B = M × A. The factorization was done by iteratively updating M and A in accord with the NMF algorithm so as to gradually reduce error between B and M × A, with error based on entropy divergence (Lee and Seung, 1999, 2001). The matrix M had two columns, containing left and right source images, and 40,000 rows. The matrix A contained mixing coefficients, which combined the two source images in M to form different binocular mixtures. Matrix A had five columns and two rows, corresponding to five pairs of mixing coefficients to produce five different binocular mixtures. (Aii) Matrix representation of binocular unmixing. The matrix W of unmixing coefficients is the Moore–Penrose generalized inverse of the mixing matrix A. (B) Neural network interpretation of the unmixing algorithm. Diagram adapted from Cichocki et al. (2009).
Figure 3Unmixing results and errors, for example runs of the NMF and ICA algorithms. Source images are shown, together with images recovered after the source images were binocularly mixed and then unmixed. Error indicates pixel subtraction (original image)–(unmixed image). Gray levels in source and unmixed images fell in the range 0.0–1.0. (A) Results for NMF unmixing algorithm. (B) Results for ICA unmixing algorithm.