| Literature DB >> 24600388 |
Alexandre Abraham1, Fabian Pedregosa1, Michael Eickenberg1, Philippe Gervais1, Andreas Mueller2, Jean Kossaifi3, Alexandre Gramfort4, Bertrand Thirion1, Gaël Varoquaux1.
Abstract
Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g., multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learning can uncover hidden structures in sets of images (e.g., resting state functional MRI) or find sub-populations in large cohorts. By considering different functional neuroimaging applications, we illustrate how scikit-learn, a Python machine learning library, can be used to perform some key analysis steps. Scikit-learn contains a very large set of statistical learning algorithms, both supervised and unsupervised, and its application to neuroimaging data provides a versatile tool to study the brain.Entities:
Keywords: Python; machine learning; neuroimaging; scikit-learn; statistical learning
Year: 2014 PMID: 24600388 PMCID: PMC3930868 DOI: 10.3389/fninf.2014.00014
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Figure 1Conversion of brain scans into 2-dimensional data.
Figure 2Maps derived by different methods for face versus house recognition in the Haxby experiment—. The masks derived from standard analysis in the original paper (Haxby et al., 2001) are displayed in blue and green.
Five fold cross validation accuracy scores obtained for different values of parameter .
| ℓ1 Logistic regression | 0.50 ± 0.02 | 0.50 ± 0.02 | 0.57 ± 0.13 | 0.63 ± 0.11 | 0.70 ± 0.12 | |
| ℓ2 Logistic regression | 0.60 ± 0.11 | 0.61 ± 0.12 | 0.63 ± 0.13 | 0.63 ± 0.13 | 0.64 ± 0.13 | |
| ℓ1 SVM classifier (SVC) | 0.50 ± 0.06 | 0.55 ± 0.12 | 0.69 ± 0.11 | 0.69 ± 0.12 | 0.68 ± 0.12 | |
| ℓ2 SVM classifier (SVC) | 0.67 ± 0.12 | 0.67 ± 0.12 | 0.66 ± 0.12 | 0.65 ± 0.12 | 0.65 ± 0.12 |
Figure 3Miyawaki results in both decoding and encoding. Relations between one pixel and four brain voxels is highlighted for both methods. Top: Decoding. Classifier weights for the pixel highlighted [(A) Logistic regression, (C) SVM]. Reconstruction accuracy per pixel [(B) Logistic regression, (D) SVM]. Bottom: Encoding. (E): receptive fields corresponding to voxels with highest scores and its neighbors. (F): reconstruction accuracy depending on pixel position in the stimulus—note that the pixels and voxels highlighted are the same in both decoding and encoding figures and that encoding and decoding roughly match as both approach highlight a relationship between the same pixel and voxels.
Figure 4Default mode network extracted using different approaches: . Data have been normalized (set to unit variance) for display purposes.
Figure 5Brain parcellations extracted by clustering. Colors are random. (A) K-means, 100 clusters, (B) Ward, 100 clusters, (C) K-means, 1000 clusters, and (D) Ward, 1000 clusters.