Literature DB >> 32113005

FilterK: A new outlier detection method for k-means clustering of physical activity.

Petra J Jones1, Matthew K James2, Melanie J Davies3, Kamlesh Khunti4, Mike Catt5, Tom Yates6, Alex V Rowlands7, Evgeny M Mirkes8.   

Abstract

In this paper, a new algorithm denoted as FilterK is proposed for improving the purity of k-means derived physical activity clusters by reducing outlier influence. We applied it to physical activity data obtained with body-worn accelerometers and clustered using k-means. We compared its performance with three existing outlier detection methods: Local Outlier Factor, Isolation Forests and KNN using the ground truth (class labels), average cluster and event purity (ACEP). FilterK provided comparable gains in ACEP (0.581 → 0.596 compared to 0.580-0.617) whilst removing a lower number of outliers than the other methods (4% total dataset size vs 10% to achieve this ACEP). The main focus of our new outlier detection method is to improve the cluster purities of physical activity accelerometer data, but we also suggest it may be potentially applied to other types of dataset captured by k-means clustering. We demonstrate our method using a k-means model trained on two independent accelerometer datasets (training n = 90) and re-applied to an independent dataset (test n = 41). Labelled physical activities include lying down, sitting, standing, household chores, walking (laboratory and non-laboratory based), stairs and running. This type of clustering algorithm could be used to assist with identifying optimal physical activity patterns for health.
Copyright © 2020 Elsevier Inc. All rights reserved.

Keywords:  Accelerometer; Activity; Anomaly; Detection; Outlier; Physical

Mesh:

Year:  2020        PMID: 32113005     DOI: 10.1016/j.jbi.2020.103397

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  1 in total

1.  Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition.

Authors:  Dionicio Neira-Rodado; Chris Nugent; Ian Cleland; Javier Velasquez; Amelec Viloria
Journal:  Sensors (Basel)       Date:  2020-03-27       Impact factor: 3.576

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.