Jos Twisk1, Trynke Hoekstra. 1. Department of Methodology and Applied Biostatistics, Institute of Health Sciences, Faculty of Earth and Life Science, Vrije Universiteit, De Boelelaan 1087, 1081 HV Amsterdam, The Netherlands. jos.twisk@falw.vu.nl
Abstract
OBJECTIVE: In the analysis of data from longitudinal cohort studies, there is a growing interest in the analysis of developmental trajectories in subpopulations of the cohort under study. There are different advanced statistical methods available to analyze these trajectories, but in the epidemiologic literature, most of those are never used. The purpose of the present study is to compare five statistical methods to detect developmental trajectories in a longitudinal epidemiological data set. STUDY DESIGN AND SETTING: All five statistical methods (K-means clustering, a "two-step" approach with mixed modeling and K-means clustering, latent class analysis [LCA], latent class growth analysis [LCGA], and latent class growth mixture modeling [LCGMM]) were performed on a real-life data set and two manipulated data sets. The first manipulated data set contained four different linear developments over time, whereas the second contained two linear and two quadratic developments. RESULTS: For the real-life data set, all five classification methods revealed comparable trajectories. Regarding the manipulated data sets, LCGA performed best in detecting linear trajectories, whereas none of the methods performed well in detecting a combination of linear and quadratic trajectories. Furthermore, the optimal solution for LCA and LCGA contained more classes compared with LCGMM. CONCLUSION: Although LCGA and LCGMM seem to be preferable above the more simple methods, all classification methods should be applied with great caution.
OBJECTIVE: In the analysis of data from longitudinal cohort studies, there is a growing interest in the analysis of developmental trajectories in subpopulations of the cohort under study. There are different advanced statistical methods available to analyze these trajectories, but in the epidemiologic literature, most of those are never used. The purpose of the present study is to compare five statistical methods to detect developmental trajectories in a longitudinal epidemiological data set. STUDY DESIGN AND SETTING: All five statistical methods (K-means clustering, a "two-step" approach with mixed modeling and K-means clustering, latent class analysis [LCA], latent class growth analysis [LCGA], and latent class growth mixture modeling [LCGMM]) were performed on a real-life data set and two manipulated data sets. The first manipulated data set contained four different linear developments over time, whereas the second contained two linear and two quadratic developments. RESULTS: For the real-life data set, all five classification methods revealed comparable trajectories. Regarding the manipulated data sets, LCGA performed best in detecting linear trajectories, whereas none of the methods performed well in detecting a combination of linear and quadratic trajectories. Furthermore, the optimal solution for LCA and LCGA contained more classes compared with LCGMM. CONCLUSION: Although LCGA and LCGMM seem to be preferable above the more simple methods, all classification methods should be applied with great caution.
Authors: Mahmood R Gohari; Joel A Dubin; Richard J Cook; Scott T Leatherdale Journal: Health Promot Chronic Dis Prev Can Date: 2019-09 Impact factor: 3.240
Authors: S C Kim; N Choudhry; J M Franklin; K Bykov; M Eikermann; J Lii; M A Fischer; B T Bateman Journal: Osteoarthritis Cartilage Date: 2017-04-19 Impact factor: 6.576