| Literature DB >> 24009746 |
Vince D Calhoun1, Vamsi K Potluru, Ronald Phlypo, Rogers F Silva, Barak A Pearlmutter, Arvind Caprihan, Sergey M Plis, Tülay Adalı.
Abstract
A recent paper by Daubechies et al. claims that two independent component analysis (ICA) algorithms, Infomax and FastICA, which are widely used for functional magnetic resonance imaging (fMRI) analysis, select for sparsity rather than independence. The argument was supported by a series of experiments on synthetic data. We show that these experiments fall short of proving this claim and that the ICA algorithms are indeed doing what they are designed to do: identify maximally independent sources.Entities:
Mesh:
Year: 2013 PMID: 24009746 PMCID: PMC3757003 DOI: 10.1371/journal.pone.0073309
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The excess kurtosis of a source as a function of the relative size of the active region.
A Gaussian has zero excess kurtosis. Here as in Example 2 of the original paper [8]. The four vertical lines at correspond to the relative sizes of the small box, the medium box, the large box, and a very large box corresponding to the maximal kurtosis case. Note that the medium and large box experiments have near zero excess kurtosis, i.e., kurtosis value matching that of a Gaussian. In addition, the pdfs of these sources are bimodal (see inset figures), ensuring that ICA algorithms designed for unimodal super-Gaussian distributions such as Infomax and FastICA with standard parameter settings, will likely fail. At the bottom of the figure are the ISI values (see Equation (2)) for the various algorithms at those four points (see Table 1 for full list). Also note the best separation performance of Infomax and FastICA for the maximum kurtosis case, which corresponds to almost the lowest level of sparsity.
Source estimates for the four cases indicated in Figure 1.
| Box Size & Properties | Results | (good is ISI <0.1) | ||
|
|
|
|
| |
| Unimodal, super-Gaussian sources |
|
|
| |
|
| 0.8829 |
|
|
|
|
| 0.8107 |
| 1.0493±0.0015 | 0.9499±0.0004 |
| Mutual Information Between Sources c1 & c2: | 0.0920 |
|
|
|
|
| ||||
|
|
|
|
| |
| Bimodal, close-to-Gaussian sources |
| 0.2068±0.0662 | 0.1464±0.0513 | |
|
| 0.2564 |
| 0.8722±0.0651 | 0.7434±0.0600 |
|
| 0.0879 |
| 0.1597±0.0058 | 0.1144±0.0041 |
| Mutual Information Between Sources c1 & c2: | 0.0929 |
|
|
|
|
| ||||
|
|
|
|
| |
| Bimodal, close-to-Gaussian sources |
| 0.4081±0.1003 | 0.3102±0.0823 | |
|
| 0.0010 |
| 1.0297±0.0009 | 0.9236±0.0005 |
|
| 0.0762 |
|
|
|
| Mutual Information Between Sources c1 & c2: | 0.0892 |
|
|
|
|
| ||||
|
|
|
|
| |
| Unimodal, super- Gaussian sources. |
|
|
| |
|
| 5.6432 |
|
|
|
|
| 5.6394 |
| 1.0711±0.0014 | 0.9762±0.0009 |
| Mutual Information Between Sources c1 & c2: | 0.0686 |
|
|
|
|
| ||||
as in Example 2 of the original paper [8]. The algorithms behave as one would expect if they are selecting for independence. For the bimodal/Gaussian-like cases, ICA-EBM and Infomax (sub) do well, and for the unimodal/maximum kurtosis/low sparsity case Infomax-super, FastICA and ICA-EBM all do extremely well. Numbers in boldface indicate when separation was good.
Figure 2The distribution of sources and mixtures for ().
We plot (A–C) the distribution of sources, and (D) the contour plot of mixtures for the case of (). Contrary to the claim made in Daubechies et al., the sources have in fact very peaky and heavy-tailed distributions and are not at all close to a Gaussian distribution. For comparison purposes we also present Gaussian distribution curves (blue, A–B).
Tabulated results for the so-called [8] ICA “promotional material.”
| Observed Properties and Results (good is ISI <0.1) | ||
| Property | Source a (sa) | Source b (sb) |
| Negentropy: | 0.2753 | 0.3708 |
| (excess) Kurtosis: | 3.0630 | 3.5225 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1.0758 | 0.9899 |
|
|
|
|
Both Infomax (super) and FastICA do successfully separate (zero ISI indicates perfect separation) the super-Gaussian sources and . Note the excess kurtosis is more than 3 for both sources. Numbers in boldface indicate when separation was good.
Figure 3Sparsity measures for three different coordinate system origins ().
Sparsity as measured with respect to different coordinate system origins (), as a function of the relative size of the active region. Remark that for a relative size of zero, only background samples are present and, thus, the mean of the mixture model coincides with the mean of the background (and the two sparsity measures correspond at this point). An analogous observation can be made for a relative size of one, now with respect to the activity (signal samples).