| Literature DB >> 28846686 |
Nicolas Sauwen1,2, Marjan Acou3, Halandur N Bharath1,2, Diana M Sima1,2,4, Jelle Veraart5, Frederik Maes6, Uwe Himmelreich7, Eric Achten3, Sabine Van Huffel1,2.
Abstract
Non-negative matrix factorization (NMF) has become a widely used tool for additive parts-based analysis in a wide range of applications. As NMF is a non-convex problem, the quality of the solution will depend on the initialization of the factor matrices. In this study, the successive projection algorithm (SPA) is proposed as an initialization method for NMF. SPA builds on convex geometry and allocates endmembers based on successive orthogonal subspace projections of the input data. SPA is a fast and reproducible method, and it aligns well with the assumptions made in near-separable NMF analyses. SPA was applied to multi-parametric magnetic resonance imaging (MRI) datasets for brain tumor segmentation using different NMF algorithms. Comparison with common initialization methods shows that SPA achieves similar segmentation quality and it is competitive in terms of convergence rate. Whereas SPA was previously applied as a direct endmember extraction tool, we have shown improved segmentation results when using SPA as an initialization method, as it allows further enhancement of the sources during the NMF iterative procedure.Entities:
Mesh:
Year: 2017 PMID: 28846686 PMCID: PMC5573288 DOI: 10.1371/journal.pone.0180268
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Coregistered set of MRI images.
Showing T1+C(A), CBV(B), ADC(C) and Lac(D). The MRSI region of interest is marked in green. Segmentation results are shown below for active tumor and necrosis, respectively, for hNMF with each type of initialization: SPA (E,F), NNDSVD (G,H), FCM (I,J) and rRandom (K,L). NMF segmentation is shown in blue, segmentation by the radiologist in green and overlap in cyan.
Comparison of the mean Dice-scores and their standard deviation between different initialization methods for the UZ Ghent dataset.
The highest Dice-score per NMF method and per tissue class is marked in bold. * indicates statistically significantly higher Dice-scores with SPA initialization compared to direct SPA endmember extraction (right column), using a one-tailed Wilcoxon signed rank test (p < 0.05).
| aHALS | GD | PG | Convex | hNMF | SPA | ||
|---|---|---|---|---|---|---|---|
| 65 ± 13 | 69 ± 15* | 64 ± 20 | |||||
| 63 ± 18 | 62 ± 18 | 64 ± 14 | 63 ± 19 | 67 ± 17 | - | ||
| 60 ± 21 | 65 ± 14 | 59 ± 23 | 69 ± 15 | - | |||
| 64 ± 19 | 63 ± 19 | 63 ± 20 | - | ||||
| 73 ± 13 | |||||||
| 74 ± 12 | 74 ± 11 | 75 ± 12 | 76 ± 12 | - | |||
| 72 ± 14 | 75 ± 11 | 73 ± 11 | 77 ± 13 | - | |||
| 73 ± 15 | - | ||||||
| 80 ± 11* | 77 ± 13 | ||||||
| 80 ± 12 | 78 ± 12 | 82 ± 10 | 84 ± 9 | - | |||
| 77 ± 14 | 79 ± 13 | 82 ± 10 | 84 ± 10 | - | |||
| 79 ± 13 | 79 ± 12 | 82 ± 13 | 85 ± 8 | - | |||
Comparison of the mean Dice-scores and their standard deviation between different initialization methods for the UZ Leuven dataset.
The highest Dice-score per NMF method and per tissue class is marked in bold. * indicates statistically significantly higher Dice-scores with SPA initialization compared to direct SPA endmember extraction (right column), using a one-tailed Wilcoxon signed rank test (p < 0.05).
| aHALS | GD | PG | Convex | hNMF | SPA | ||
|---|---|---|---|---|---|---|---|
| 74 ± 15* | 67 ± 26 | ||||||
| 68 ± 29 | 70 ± 21 | 69 ± 23 | - | ||||
| 71 ± 21 | 69 ± 22 | 69 ± 23 | 74 ± 15 | - | |||
| 71 ± 22 | 68 ± 29 | 71 ± 22 | 74 ± 15 | - | |||
| 84 ± 8* | 83 ± 10* | 83 ± 11* | 79 ± 13 | ||||
| 84 ± 8 | 80 ± 14 | 84 ± 8 | 81 ± 11 | 84 ± 9 | - | ||
| 84 ± 8 | 84 ± 9 | - | |||||
| 84 ± 7 | - | ||||||
| 82 ± 10 | 80 ± 11 | ||||||
| 81 ± 15 | 83 ± 8 | - | |||||
| 83 ± 8 | 83 ± 8 | - | |||||
| 81 ± 10 | 83 ± 9 | 81 ± 10 | - | ||||
Mean number of iterations to reach convergence for the different initialization methods on the UZ Ghent dataset.
Convergence tolerance was set to 10−5 and the maximum number of iterations to 10000.
| #Iterations | ||||
|---|---|---|---|---|
| SPA | NNDSVD | FCM | rRandom | |
| 179 | 181 | 140 | 190 | |
| 218 | 175 | 832 | 418 | |
| 84 | 80 | 85 | 109 | |
| 8923 | 8796 | 8819 | 8810 | |
Fig 2Convergence plots for aHALS NMF (A), GD NMF (B), Convex NMF (C), and PG NMF (D) with the different initialization methods.
The residual error, ∥X − WH∥, is shown on a log scale. For rRandom and FCM, the shown curve corresponds to the selected run with the lowest residual error.