Gang Liu1, Jukka-Pekka Onnela1. 1. Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.
Abstract
OBJECTIVE: We propose a bidirectional GPS imputation method that can recover real-world mobility trajectories even when a substantial proportion of the data are missing. The time complexity of our online method is linear in the sample size, and it provides accurate estimates on daily or hourly summary statistics such as time spent at home and distance traveled. MATERIALS AND METHODS: To preserve a smartphone's battery, GPS may be sampled only for a small portion of time, frequently <10%, which leads to a substantial missing data problem. We developed an algorithm that simulates an individual's trajectory based on observed GPS location traces using sparse online Gaussian Process to addresses the high computational complexity of the existing method. The method also retains the spherical geometry of the problem, and imputes the missing trajectory in a bidirectional fashion with multiple condition checks to improve accuracy. RESULTS: We demonstrated that (1) the imputed trajectories mimic the real-world trajectories, (2) the confidence intervals of summary statistics cover the ground truth in most cases, and (3) our algorithm is much faster than existing methods if we have more than 3 months of observations; (4) we also provide guidelines on optimal sampling strategies. CONCLUSIONS: Our approach outperformed existing methods and was significantly faster. It can be used in settings in which data need to be analyzed and acted on continuously, for example, to detect behavioral anomalies that might affect treatment adherence, or to learn about colocations of individuals during an epidemic.
OBJECTIVE: We propose a bidirectional GPS imputation method that can recover real-world mobility trajectories even when a substantial proportion of the data are missing. The time complexity of our online method is linear in the sample size, and it provides accurate estimates on daily or hourly summary statistics such as time spent at home and distance traveled. MATERIALS AND METHODS: To preserve a smartphone's battery, GPS may be sampled only for a small portion of time, frequently <10%, which leads to a substantial missing data problem. We developed an algorithm that simulates an individual's trajectory based on observed GPS location traces using sparse online Gaussian Process to addresses the high computational complexity of the existing method. The method also retains the spherical geometry of the problem, and imputes the missing trajectory in a bidirectional fashion with multiple condition checks to improve accuracy. RESULTS: We demonstrated that (1) the imputed trajectories mimic the real-world trajectories, (2) the confidence intervals of summary statistics cover the ground truth in most cases, and (3) our algorithm is much faster than existing methods if we have more than 3 months of observations; (4) we also provide guidelines on optimal sampling strategies. CONCLUSIONS: Our approach outperformed existing methods and was significantly faster. It can be used in settings in which data need to be analyzed and acted on continuously, for example, to detect behavioral anomalies that might affect treatment adherence, or to learn about colocations of individuals during an epidemic.
Authors: Ian Barnett; John Torous; Patrick Staples; Luis Sandoval; Matcheri Keshavan; Jukka-Pekka Onnela Journal: Neuropsychopharmacology Date: 2018-02-22 Impact factor: 7.853
Authors: Lin Yang; Chao Cao; Elizabeth D Kantor; Long H Nguyen; Xiaobin Zheng; Yikyung Park; Edward L Giovannucci; Charles E Matthews; Graham A Colditz; Yin Cao Journal: JAMA Date: 2019-04-23 Impact factor: 56.272
Authors: Nikhil Panda; Ian Solsky; Becky Hawrusik; Gang Liu; Harrison Reeder; Stuart Lipsitz; Eesha V Desai; Kurt W Lowery; Kate Miller; Michele A Gadd; Carrie C Lubitz; Barbara L Smith; Michelle Specht; Jukka-Pekka Onnela; Alex B Haynes Journal: Ann Surg Oncol Date: 2020-08-18 Impact factor: 5.344
Authors: James D Berry; Sabrina Paganoni; Kenzie Carlson; Katherine Burke; Harli Weber; Patrick Staples; Joel Salinas; James Chan; Jordan R Green; Kathryn Connaghan; Josh Barback; Jukka Pekka Onnela Journal: Ann Clin Transl Neurol Date: 2019-04-03 Impact factor: 4.511