| Literature DB >> 32217284 |
Jung Hun Oh1, Maryam Pouryahya2, Aditi Iyer2, Aditya P Apte2, Joseph O Deasy2, Allen Tannenbaum3.
Abstract
The Wasserstein distance is a powerful metric based on the theory of optimal mass transport. It gives a natural measure of the distance between two distributions with a wide range of applications. In contrast to a number of the common divergences on distributions such as Kullback-Leibler or Jensen-Shannon, it is (weakly) continuous, and thus ideal for analyzing corrupted and noisy data. Until recently, however, no kernel methods for dealing with nonlinear data have been proposed via the Wasserstein distance. In this work, we develop a novel method to compute the L2-Wasserstein distance in reproducing kernel Hilbert spaces (RKHS) called kernel L2-Wasserstein distance, which is implemented using the kernel trick. The latter is a general method in machine learning employed to handle data in a nonlinear manner. We evaluate the proposed approach in identifying computed tomography (CT) slices with dental artifacts in head and neck cancer, performing unsupervised hierarchical clustering on the resulting Wasserstein distance matrix that is computed on imaging texture features extracted from each CT slice. We further compare the performance of kernel Wasserstein distance with alternatives including kernel Kullback-Leibler divergence we previously developed. Our experiments show that the kernel approach outperforms classical non-kernel approaches in identifying CT slices with artifacts.Entities:
Keywords: Kernel Kullback–Leibler divergence; Kernel Wasserstein distance; Kernel trick; Reproducing kernel Hilbert space
Mesh:
Year: 2020 PMID: 32217284 PMCID: PMC7237301 DOI: 10.1016/j.compbiomed.2020.103731
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 4.589