Literature DB >> 36164545

Great diversity in the utilization and reporting of latent growth modeling approaches in type 2 diabetes: A literature review.

Sarah O'Connor1,2, Claudia Blais2,3, Miceline Mésidor4,5, Denis Talbot4,5, Paul Poirier1,2, Jacinthe Leclerc1,2,6.   

Abstract

Introduction: The progression of complications of type 2 diabetes (T2D) is unique to each patient and can be depicted through individual temporal trajectories. Latent growth modeling approaches (latent growth mixture models [LGMM] or latent class growth analysis [LCGA]) can be used to classify similar individual trajectories in a priori non-observed groups (latent groups), sharing common characteristics. Although increasingly used in the field of T2D, many questions remain regarding the utilization of these methods. Objective: To review the literature of longitudinal studies using latent growth modeling approaches to study T2D.
Methods: MEDLINE (Ovid), EMBASE, CINAHL and Wb of Science were searched through August 25th, 2021. Data was collected on the type of latent growth modeling approaches (LGMM or LCGA), characteristics of studies and quality of reporting using the GRoLTS-Checklist and presented as frequencies.
Results: From the 4,694 citations screened, a total of 38 studies were included. The studies were published beetween 2011 and 2021 and the length of follow-up ranged from 8 weeks to 14 years. Six studies used LGMM, while 32 studies used LCGA. The fields of research varied from clinical research, psychological science, healthcare utilization research and drug usage/pharmaco-epidemiology. Data sources included primary data (clinical trials, prospective/retrospective cohorts, surveys), or secondary data (health records/registries, medico-administrative). Fifty percent of studies evaluated trajectory groups as exposures for a subsequent clinical outcome, while 24% used predictive models of group membership and 5% used both. Regarding the quality of reporting, trajectory groups were adequately presented, however many studies failed to report important decisions made for the trajectory group identification.
Conclusion: Although LCGA were preferred, the contexts of utilization were diverse and unrelated to the type of methods. We recommend future authors to clearly report the decisions made regarding trajectory groups identification.
© 2022 The Author(s).

Entities:  

Keywords:  Care trajectory; Diabetes mellitus; Epidemiology; Group-based trajectory modeling; Health-care utilization; Latent class growth modeling

Year:  2022        PMID: 36164545      PMCID: PMC9508412          DOI: 10.1016/j.heliyon.2022.e10493

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

Type 2 diabetes (T2D) is a major public health threat worldwide, with approximately 1 person out of 11 living with the disease in 2019 [1]. To prevent complications, the management of T2D requires a long-term control of glycemia, healthy lifestyle behaviors, adequate medication usage as well as regular medical examinations [1]. However, in a real-life context, the management of T2D can differ between patients due to individual and environmental factors, but may also change in time (e.g. changes in treatment guidelines, preferences, physicians, living contexts, etc.). This heterogeneity may be reflected through different longitudinal trajectories in adherence patterns, glycemic control, lifestyle behaviors or healthcare utilization, unique to each patient. Latent growth modeling approaches are statistical methods that consider individuals being part of a heterogeneous population, composed of unobserved groups of individual trajectories sharing similar characteristics [2, 3]. The goal of latent growth modeling approaches is to estimate a given number of a priori unobserved latent groups within a population, based on the probability of membership of individuals to a specific trajectory group [2]. Two distinct methods of latent growth modeling approaches can be identified, namely Latent Growth Mixture Modeling (LGMM) and Latent Class Growth Analysis (LCGA) [LCGA is also known as Latent Class Growth Modeling (LCGM) or Group-Based Trajectory Modeling (GBTM)] [4]. LGMM allows the modelization of within-class heterogeneity [5]. In contrast, LCGA assumes that no within-group heterogeneity remains within the trajectory groups [2]. Latent growth modeling approaches, which have been initially developed in the fields of social sciences, psychopathology or criminology [3] are now increasingly used in the field of T2D. These approaches are well suited for the study of T2D given the repeated follow-ups in time, and also encourage the consideration of the overall temporal trajectories of care for clinical decisions [6, 7]. The latent groups may help identifying subgroups of the population with different needs and developing adapted care strategies [7]. The possibility of using these approaches with multiple types of data, long follow-up lengths and unstructured datasets makes these methods highly appealing in the study of T2D [3]. However, latent growth modeling approaches are complex to use; many decisions must be made and should be reported correctly in order to ensure comparability between studies [4]. Accordingly, the Guidelines for Reporting on Latent Trajectory Studies (GRoLTS) checklist was build up for maximised transparency and to improve comparison between studies [4]. Although there is a growing number of studies using latent growth modeling approaches in the field of T2D, the variety of methods used, their context of utilization and quality of reporting remain unknown. Hence, by identifying longitudinal studies using latent growth modeling approaches among individuals with T2D, we sought (1) to identify the latent growth modeling approaches used, (2) to describe the context of utilization of the different methods, and (3) to assess the quality of reporting, using the GRoLTS checklist framework [4]. Of note, this review did not seek to describe the methods per se, which have been done previously [2, 5, 8]. To our knowledge, this review is the first to comprehensively describe the latent growth modeling approaches used in the field of T2D as of December 2021.

Materials and methods

Eligibility criteria

This study is a review of longitudinal studies using latent growth modeling approaches among human cases with T2D, as defined by the authors. Studies needed to have at least 50% of participants with T2D to be included. Eligible studies must have used a method for discriminating latent groups of individual trajectories, no matter the duration of the follow-up. Studies determining groups using standard/arbitrary thresholds (e.g. body mass index categories, age groups, etc.) or sample stratifications (quartiles, percentiles, medians, interquartile ranges) were excluded. Only published peer-reviewed studies written in English or French were included. Case-control, cross-sectional and qualitative studies as well as reviews, grey literature and conference abstracts were excluded.

Search of the studies

MEDLINE (Ovid), EMBASE, CINAHL and Web of Science were systematically searched through August 25th, 2021, using a prespecified search strategy available in the Supplemental material. Citations were combined and duplicates were removed in Endnote 20 software (Clarivate, London, United Kingdom). Selection and full-text eligibility were performed by one reviewer (SO) in Covidence software (Melbourne, Australia). Since some published articles shared the same population/data sources but had different primary outcomes and methods, they were considered as two separated studies. However, any article using the same data source with duplicate methods and identical objectives were considered as only one study.

Data extraction

Data extraction was performed by one reviewer (SO) using a pre-piloted form in Excel software (Microsoft, Redmond, Washington, USA). Extracted data included: (1) Type of method: Method as identified in the studies and associated references, (2) Context of utilization: Study design, year of publication, data source(s), objective(s), field of research, population (incident or prevalent cases of T2D), number of subjects, exposure(s)/outcome(s) of interest, time-points, follow-up periods, other statistical method(s) (descriptive analysis, trends, predictive models, explanatory models, trajectory groups as exposure or outcomes) and, (3) Quality of reporting using the GRoLTS checklist [4]. The GRoLTS checklist is composed of 16 items, covered by 21 questions [4]. In addition, we collected information about actions taken for preventing indication bias and immortal time bias. We tabulated the results in summary tables inspired by Jandoc et al. [9].

Results

Characteristics of included studies

From the 4,694 citations screened, a total of 38 publications were included in this review (Figure 1). The characteristics of studies are summarized in Table 1 and Table 2. The studies were published between 2011 and 2021, from which 76% were published between 2016 and 2021. Sample sizes varied from 84 to 677,618 participants and total follow-up ranged from 8 weeks to 14 years. However, 95% of studies had a total follow-up of ≥1 year and 57%, ≥5 years. The variables used for modeling trajectory groups varied from: (1) biomarkers/clinical tests (glycated hemoglobin A1c (HbA1c) [n = 12], blood glucose levels [n = 2], blood pressure [n = 2], estimated glomerular filtration rate (eGFR) [n = 2], body mass index [n = 2], fundus photography [n = 1]), (2) drug usage outcomes (medication event monitoring system caps [n = 1], proportion of days covered [n = 2], medication possession ratio [n = 1]), (3) healthcare utilization indicators (vaccination [n = 1], medical visits [n = 2]), (4) patient reported outcomes (depressive symptoms [n = 2], anxiety symptoms [n = 2], diabetes distress symptoms [n = 1], self-reported health [n = 1], quality of life [n = 1]), along with other types of variables (step counts, utilization of cellphone application, hypoglycemic episodes). Eight studies included only incident cases of T2D, while 2 studies recruited participants with a diagnosis of T2D in the last 2 years. The other studies included participants with various lengths of T2D duration (prevalent cases).
Figure 1

PRISMA 2020 Flow diagram of included studies [10].

Table 1

Characteristics of studies using latent growth modeling approaches among patients with type 2 diabetes.

StudyData source(s) and typeObjectivesField/context of researchT2D populationExposures evaluated and toolsOutcome evaluated and toolsMethods used for research questionTime scaleMethod used
Latent growth mixture modeling
de Vries McClintock 2016 [17,18]Cohort: Brief Intervention to Improve Adherence through Integrated Management of type 2 Diabetes Mellitus and Depression Treatment (2010–2011)Type of data source: Primary data (clinical trial)To understand:1- The course of oral hypoglycemic agent adherence patterns over 12 weeks among T2D patients in primary care.2-Whether such patterns are related to patient characteristics.3-Whether patterns predict glycemic control at 12 weeks.Drug utilization research/pharmaco-epidemiologyPrevalent cases(n = 182)Integrated Care interventionTool: Educational intervention by integrated care managersExplicative model: Adherence trajectoriesAdherence trajectoriesTool: Medication Event Monitoring System capsExplicative model: HbA1c at 12 weeks after trajectory analysisTool: in2it A1C AnalyzerComparison between groups: X2 and ANOVAExplicative model: multivariate logistic regressionTime-point studied: 1 weekTrajectory identification: 12 weeksOutcome assessment: at week 12Follow-up period: 12 weeksGeneral growth curve mixture modeling [21, 22, 23, 24, 25, 26]
Hertroijs 2018 [14]Cohort:Zwolle Outpatient Diabetes Project Integrating Available Care (2006–2013); Primary care groupZIO (validation cohort) (2009–2014)Type of data source: Secondary data (electronic health records)1-To identify, predict and validate distinct glycemic trajectories among patients with newly diagnosed T2D treated in primary care.Clinical researchIncident cases (n = 10,528)Predictors of trajectoriesTool: Sociodemographic factors, clinical/laboratory measures, comorbidities, family medical historyHbA1c trajectoriesTool: Not reportedComparison between groups: ANOVA, X2Validation or predictive model: multinomial logistic backward regressionTime-point studied: 1 year (±3 months)Follow-up period: 4–5 yearsLatent growth mixture modelling [2, 3, 27]
Laiteerapong 2017 [16]Cohort: Kaiser Permanente Northern California Diabetes Registry (1997–2012)Type of data source: Secondary data (electronic health records, medico-administrative)1-To classify trajectories of long term HbA1c values after diagnosis of T2D.2- To examine each trajectory's associations with subsequent microvascular and macrovascular events and mortality.Clinical researchIncident diabetes (n = 28,016) (n = 25,732 for outcomes)HbA1c trajectories Tool: Not reportedDiabetes complicationsTool: ICD-9 and procedures codesExplicative model: Multivariate Cox proportional hazard regression modelTime-point studied: 1 yearTrajectory identification: 10 yearsOutcome assessment: 4 yearsTotal follow-up period: 14 yearsLatent growth mixture modeling [24, 28]
Wang 2011 [15]Cohort: San Antonio Longitudinal Study of Aging (1992–1996 to 2000–2005)Type of data source: Primary data (prospective cohort)1-To examine whether better glycemic control improves the maintenance of lower-extremity physical function over a 36-month period among participants with diabetes.Clinical researchPrevalent cases (n = 119)HbA1c trajectoriesTool: Not reportedLower-extremity functionTool: Short physical performance batteryComparison between classes:Pseudo-class estimation techniqueExplicative model: Mixture path modeling with propensity scoreTime-point studied: 6 monthsOutcome assessment: 18 and 36 monthsTotal follow-up period: 36 monthsLatent growth mixture modeling [22, 29]
Whitworth 2017 [12]Cohort: Fremantle Diabetes Study Phase II (2008–2011, follow-up until 2016)Type of data source: Primary data (prospective cohort)1-To describe the long-term trajectories of depression symptom severity in people with T2D.2-To identify predictors and associates of these trajectories.Psychological sciencePrevalent cases (n = 1,201)Predictors of trajectories: Clinical, biochemical, demographic, depression, psychosocial dataTrajectories of depression symptomsTool: Patient Health QuestionnaireComparison between groups: One-Way ANOVA, X2, Mann-Whitney U test, Kruskal-Wallis testPredictive model: Binary logistic regression modelTime-point studied: 2 yearsTrajectory identification follow-up: 4 yearsTotal follow-up: 4 yearsLatent class growth analysis [30]
Whitworth 2020 [13]Cohort: Fremantle Diabetes Study Phase II (2008–2011, follow-up until 2016)Type of data source: Primary data (prospective cohort)1-To identify distinct trajectories of anxiety symptoms in individuals with T2D over time.2-To identify demographic, self-management and clinical predictors of anxiety trajectory membership.3-To assess whether having a lifetime history of anxiety or depression predicts anxiety trajectory membership beyond significant demographic predictors.4-To examine whether anxiety trajectory membership is associated with important self-management and clinical outcomes, after controlling for these variables at baseline.Psychological sciencePrevalent cases (n = 1,549)Obj 1–3: Predictors of trajectories: Clinical, biochemical, demographic, depression, psychosocial dataObj 4: Trajectories of anxiety symptom severityTool: Generalized Anxiety Disorder ScaleObj 1–3: Trajectories of anxiety symptom severityTool: Generalized Anxiety Disorder ScaleObj 4: Self-management and clinical variables at year 4Tool: Frequency of self-monitoring of blood glucoseComparison between groups: X2, one-way ANOVA, Kruskal-WallisPredictive model: Bivariate logistic regression model (unadjusted and adjusted)Explicative model: Linear regression model, adjustment for confounding factorsTime-point studied: 2 yearsTrajectory identification follow-up: 4 yearsTotal follow-up: 4 yearsLatent growth Mixture modeling [30]
Latent class growth analyses
Bayliss 2011 [31]Cohort: Health maintenance organization cancer registry (1998–2008)Type of data source:Secondary data (Medico-administrative)1-To assess the effect of incident stage 0, 1, 2 or 3 breast, colon or prostate cancer; incident depression; or an exacerbation of COPD on control of T2D.Clinical researchPrevalent cases of T2D and incident cancer/depression or COPD exacerbation (n = 5,883)Months since diagnostic of cancerTool: ICD-9, prescriptions, hospitalizationsHbA1c trajectoriesTool: Not reportedDescriptive analyses for comparing trajectory groups (graphs)Time-point studied: 3 monthsTrajectory identification: 18 monthsTotal follow-up: 84 monthsLatent class growth modeling [32]
Bocquier 2019 [33]Cohort: Permanent Sample of Beneficiaries (2006–2015)Type of data source: Secondary data (Medico-administrative)1-To identify temporal trajectories of seasonal influenza vaccination uptake2- To describe their clinical characteristics.Healthcare utilization researchPrevalent cases (n = 15,766)Predictors of trajectoriesTools: Demographic, clinical, and healthcare utilization factorsTrajectories of seasonal influenza vaccinationTool: binary variable for each influenza seasonComparison between groups: ANOVA, X2Predictive model: multivariate logistic regressionTime-point studied: 1 yearFollow-up period: 10 yearsGroup-based trajectory modeling [3, 34, 35, 36]
Botvin Moshe 2020 [37]Cohort: Israel Diabetes and Cognitive Decline study (from 1998, years not reported)Type of data source: Primary data (prospective cohort, registry)1-To investigate the associations of long-term measurements of body mass index with indices of carotid stiffness and atherosclerosis among non-demented diabetes patients.Clinical researchPrevalent cases (n = 471)Body mass index trajectoriesTool: body mass index equationCarotid intime-media thickness, distensibility coefficient and elastography strain ratio, carotid plaque volumeTool: Carotid ultrasound DopplerComparison between groups: t-tests, X2, ANOVA, WilcoxonExplicative model: linear regression model or logistic regression modelTime-point studied: 1 monthTrajectory identification follow-up: 120 months prior baselineOutcome assessment: 36 months after baselineMultinomial modeling strategy [34]
Chen 2016 [38]Cohort: National Health insurance claims data (2002–2008)Type of data source: Secondary data (Medico-administrative)1-To identify medication adherence trajectories among patients with newly diagnosed diabetes2-To examine the association of continuity of care and medication adherence among various adherence trajectories.Healthcare utilization researchIncident case (n = 12,123)Continuity of careTool: Continuity of care indexAdherence trajectoriesTool: Medication possession ratioExplicative model: Multivariate logistic regression modelTime-point studied: 1 yearFollow-up period: 6 yearsGroup-based trajectory modeling [32, 34, 35, 39]
Chiu 2013 [19]Cohort: Health and Retirement Study (1992–2002)Type of data source: Primary data (retrospective cohort)1-To identify the main patterns of weight and disability trajectories experienced by middle aged and older adults with diabetes.2-To identify the proportion of each trajectory in the population.3-To examine the association between weight trajectories and disability trajectories later in life, as well as baseline sociodemographic, clinical, behavioral, and diabetes-related factors.Clinical researchPrevalent cases (self reported) (n = 1,064)Body mass index trajectoriesTool: body mass index equationDisability trajectoriesTool: Activities of daily life score; Instrumental activities of daily lifeComparison between classes: ANOVA and X2Dual trajectory modeling: conditional probability of membership across trajectory groups.Time-point studied: 1.5–2 yearsFollow-up period: 10 yearsGroup-based semi-parametric mixture modeling approach and dual trajectory modeling [34, 34, 40, 41]
Chiu 2017 [42]Cohort: Taiwan Longitudinal Study on Aging (1996–2007)Type of data sources: Primary data (survey)1-To identify distinct trajectories of depressive symptoms after diagnosis of diabetes in middle-aged and older adults.2-To ascertain the proportion of adults in each trajectory.3-To examine antecedent and subsequent factors associated with each trajectory.Psychological scienceIncident cases, no diabetes in 1996, but diabetes in 1999 (self-reported) (n = 487)Predictive model: Predictors of trajectories: laboratory and clinical measures, socio-demographic measures, mobility score, self-rated health, comorbidities, lifestyle behaviorsExplicative model: trajectory groupsPredictive model: Depression symptoms trajectoriesTool: Center of Epidemiological Studies – Depression scaleExplicative model: future disabilityTools: Activities of daily life score; Instrumental activities of daily life Nagi's mobility itemComparison between groups: X2 and ANOVAPredictive model: Multinomial logistic regressionExplicative model: Multiple regressionTime-point studied: 1 yearTrajectory identification: 8 yearsOutcome assessment: 2007Latent class growth modeling [32, 35, 39]
Cooke 2020 [43]Cohort: SMARTER randomized controlled trial (2012–2016)Type of data source: Primary data (clinical trial)1-To examine indicators of trajectory membership of both steps/day and changes from baseline steps/day over the 1-year intervention.Clinical researchOverweight/obese adults with T2D and/or hypertension (n = 118)Predictors of trajectory group membershipTools: Baseline sociodemographic, T2D, time of intervention, anthropometric, clinical dataTrajectories of mean septs/dayTool: Average steps/day on a 30-day periodPredictive model: Cumulative logistic regressionTime-point studied: 30 daysFollow-up period: 1 yearGroup-based trajectory modeling [3, 34]
Davis 2016 [44]Cohorts: Fremantle Diabetes Study Phase 1 (1993–1996) and Western Australia data linkage system (1998–2001)Type of data source: Primary data (prospective cohort)1-To determine whether there was a mortality benefit of tight glycemic control beyond the period in which it was implemented in recently diagnosed patients; a neutral or increased risk of death in those with long-duration diabetes.Clinical researchPrevalent cases (n = 531)Subgroups according to diabetes durationHbA1c trajectoriesTool: Not reportedMortality at 5 yearsTool: Not reportedExplicative model: Kaplan-Meier and multivariate Cox proportional modelTime-point studied: 1 yearOutcome assessment: at year 5Total follow-up period: 5 yearsSemi-parametric group-based modeling strategy [34, 45, 46]
Davis 2016 [47]Cohorts: Fremantle Diabetes Study Phase 1 (1993–1996) and Western Australia data linkage system (1998–2001)Type of data source: Primary data (prospective cohort)1-To investigate the association between estimated GFR and all-cause mortality, including the contribution of temporal eGFR changes.Clinical researchPrevalent cases (n = 1,296)eGFR trajectoriesTool: Laboratory measuresMortality at year 5Tool: Linkage to Western Australia Data Linkage System, all death registration and hospitalisationsComparison between groups: Fisher exact testExplicative model: Kaplan-Meier and multivariate Cox proportional modelTime-point studied: 1 yearOutcome assessment: at year 5Total follow-up period: 5 yearsSemi-parametric group-based modeling [45, 46]
Deschênes 2018 [48]Cohort: Evaluation of Diabetes Treatment study (2011–2014)Type of data source: Primary data (surveys)1-To examine latent longitudinal trajectories of anxiety symptoms in adults with T2D and their associations with incident cardiovascular disease.Psychological sicencePrevalent cases (n = 832)Trajectories of anxiety symptomsTool: Generalized AnxietyDisorder ScaleCardiovascular disease at 24 months, 36 months, 48 months.Tool: Diabetes Complications Index (DCI)Comparison between groups: X2Explicative model:Univariate and multivariate logistic regression modelTime-point studied: 1 yearTrajectory identification follow-up: 2 yearsOutcome assessment: 3 yearsTotal follow-up period: 5 yearsSemi-parametric latent class growth modeling [34, 39]
Goh 2015 [49]Cohort: Not reported (primary care clinic northeastern part of Singapore) (2013–2014)Types of cohorts: Primary data (clinical trial)1-To identify and describe short-term trajectories of use of the Interactive Diet and Activity Tracker app in a primary care setting 2-To identify patient characteristics associated with each trajectory.Clinical researchPrevalent cases (n = 84)Predictors of trajectories: Laboratory and clinical measures, sociodemographic measures, lifestyle behaviorsTrajectories of utilization of a telephone applicationTool: monitored weeklyComparison between groups and predictive model: Univariate/multivariate stepwise polytomous logistic regressionTime-point studied: 1 weekFollow-up period: 8 weeksLatent class growth modeling [3, 34, 35, 50]
Lee 2018 [51]Cohort: A medical centre in Taiwan(2009–2012, until 2014)Type of data source: Primary and secondary data (retrospective cohort, hospital records)1-To investigate the effect of changes in fasting plasma glucose variability, as assessed by 2-year trajectories of fasting plasma glucose variability, on mortality risk in patients with T2D.Clinical researchPrevalent cases (n = 3,569)Glucose variabilityTool: mean glucose and coefficient of variation of visit-to-visit of fasting glucose levelMortalityTool: Hospital recordsComparison between groups: X2 and ANOVAExplicative model: Kaplan-Maier & log-rank testCox proportional hazard for cardiovascular mortalityTime-point studied: 3 monthsTrajectory identification: 2 yearsOutcome assessment: 4 yearsFollow-up period: 6 yearsLatent class growth modeling [3, 50, 52]
Li 2018 [53]Cohort: The China Kailuan study (2006–2014)Type of data source:Primary data (retrospective cohort)1-To investigates the effect of long-term systolic blood pressure trajectory on kidney damage in the diabetic population.Clinical researchPrevalent diabetes (n = 4,556)Blood pressure trajectoriesTools: corrected desktop mercury sphygmomanometer & electronic sphygmomanometerOnset of kidney damage in 2014Tool: eGFR, proteinureaComparison between groups: ANOVA, Least Significant Difference test and Dunnett test, X2Explicative model: Multivariate logistic regressionTime-point: 2 yearsTrajectory identification: 8 yearsOutcome assessment: in 2014 onlyTotal follow-up: 8 yearsTrajectory model [3, 34, 54, 55]
Li 2021 [56]Cohort: Taiwan's National Health Insurance Research Database (2002–2003)Type of data source: Secondary data (medico-administrative, electronic health records)1-To investigate associations between exposure to various trajectories of severe hypoglycemic events and risk of dementia in patient with T2D.Healthcare utilization researchPrevalent cases (n = 677,618)Hypoglycemic episodes trajectoriesTool: ICD-9-CM (clinical modification) codesDementiaTool: ICD-9-CM codesExplicative model: Multivariate Cox proportional hazard regression model, with sub-distribution Hazard RatioTime-point studied: 6 months.Trajectory identification: 3 yearsOutcome assessment: average: 6 years, max 7 yearsGroup-based trajectory modeling [34, 39, 54]
Lipscombe 2015 [57]Cohort: Evaluation of Diabetes Treatment study (2011–2014)Type of data source: Primary data (surveys)1-To identify and describe a set of distinct longitudinal trajectories of diabetes distress over 4 years of follow-up time.Psychological sciencePrevalent cases (n = 1,135)Trajectories of diabetes distressTool: Diabetes Distress ScaleCharacteristics: Socio-demographic, mental health, diabetes-related and lifestyle factorsComparison between groups: ANOVA, X2; Bonferroni correctionTime-point studied: 1 yearFollow-up period: 3 yearsLatent class growth modeling [32]
Lo-Ciganic 2016 [58]Cohort: Pennsylvania Medicaid administrative claims data (2007–2011)Type of data source:Secondary data (medico-administrative)1-To examine the association between adherence trajectories for oral hypoglycemics and subsequent hospitalizations among diabetic patients.Pharmacoepidemiology/Drug utilization researchNew users of oral hypoglycemic agents (n = 16,256)Adherence trajectories to oral hypoglycemicsTools: number of days covered by at least one hypoglycemic medication/30 daysHospitalisation related to diabetes/all-cause hospitalisationTools: ICD-9 codes, Current Procedural Terminology procedure codesComparison between groups: X2 and other tests (not mentioned)Explicative model: Multivariate Cox proportional modelTime-point studied: 1 monthTrajectory identification: 1 yearOutcome assessment: 1 yearTotal follow-up period: 2 yearsGroup-based trajectory modeling [3, 8, 39, 45]
Low 2019 [11]Cohort: Name of the cohort not reported (2002–2017)Type of data source: Primary data (prospective cohort)1-To characterize HbA1c trajectories and examine their associations with chronic kidney disease progression.Clinical researchPrevalent cases (n = 770)HbA1c trajectoriesTool: Standard laboratory measuresChronic kidney disease progression over timeTool: Decline of eGFR categoryComparison between groups: X2, one way ANOVA, Mann-Whitney test, t-testExplicative model: Multivariate Cox proportional hazards regressionTime-point studied: 1 yearTrajectory identification follow-up: 8 yearsOutcome assessment: 8 yearsTotal follow-up period: 8 yearsGroup-based trajectory modeling [39, 45]
Luo 2017 [59]Cohort: Singapore Consortium of Cohort Studies Diabetes Cohort (2004–2010)Type of data sources: Primary and secondary data (prospective cohort, medico-administrative, electronic health records)1-To examine longitudinal trends in HbA1c in a multi-ethnic Asian cohort of diabetes patients, 2-To examine the associations of these trends with future risk of acute myocardial infarction, stroke, end stage renal failure and all-cause mortality.Clinical researchPrevalent cases (n = 6,079)HbA1c trajectories after catheterizationTool: Not reportedTrends of change in serum lipidsTool: NRTime-to-event: death, acute myocardial infarction, stroke, end stage renal failure, serum lipids, all-cause mortalityTool: Linkage to National Registry of Disease OfficeComparison between groups: Kruskal-Wallis, X2Explicative model: Multivariate generalized estimating equation, multivariate Cox proportional hazards regressionTime-point studied: 12 monthsTrajectory identification: median of 4.1 years prior recruitmentOutcome assessment: median of 7 years–8.3 years, post-recruitment depending on the outcomeLatent class growth modeling [3, 8, 60]
Luo 2019 [61]Cohorts: Singapore Population Health Studies Diabetic Cohort (2004–2010) and National Healthcare Group Polyclinics (2011–2016)Types of cohorts: primary and secondary data (prospective cohort, electronic health records)1-To analyze diabetes treatment and treatment changes in association with long-term glycemic patterns in an Asian population with diabetes.Clinical research/healthcare utilization researchPrevalent cases (n = 6,218)HbA1c trajectoriesTool: Not reportedAnnual treatment plansTool: Antidiabetic medication, alone or in combinationComparison between groups: ANOVA, Kruskal-Wallis, Pearson X2Comparison between groups in time: Cochrane ArmitageTime-point studied: 1 yearTotal follow-up period: 6 yearsLatent class growth analysis [3, 8, 60]
Niaz 2021 [62]Cohort: Alberta Health databases (2008–2018)Type of data source: Secondary data (medico-administrative)1-To characterize adherence to oral antihyperglycemic medications in the year before a depressive episode2-To examine the association between depression and medication adherence.Pharmaco-epidemiologyNew users of metformin (n = 165,056)No exposition for trajectory modelingTrajectories of adherenceTool: Proportion of days covered to oral antihyperglycemic medications (<80% as threshold)Descriptive analysis: Presentation of trajectories of proportion of days covered prior depression diagnosisTime-point studied: 30 daysFollow-up period: 1 yearGroup-based trajectory modeling [55]
Obura 2020 [63]Cohort: Medicines Initiative-Diabetes Research on Patient Stratification (IMI-DIRECT) StudyType of data source: Primary data (prospective cohort)1-To examine the association between subgroups based on their glucose curves during a five-point mixed-meal tolerance test and metabolic traits at baseline and glycemic deterioration in individuals with T2D.Clinical researchIncident and prevalent cases (within 6–24 months) (n = 789)Glucose curves subgroups following a mixed-meal tolerance testMetabolic traits and glucose deterioration at 18 monthsComparison between groups: general linear modelExplicative model: multivariate general linear model & Bonferroni correctionTime-point studied: 30 minTrajectory identification follow-up: 120 minOutcome assessment: At 18 monthsTotal follow-up: 18 monthsLatent class trajectory analysis (reference not reported)
Raghavan 2020 [6]Cohort: United States veteran healthcare system (2005–2006)Type of data source: Secondary data (medico-administrative, electronic health records)1-To identify glycemic control trajectories.2-To describe the characteristics of patients with distinct HbA1c trajectories3- To compare the associations of HbA1c trajectories and single HbA1c measurements with short-term mortality.Clinical researchPrevalent cases (<2 years since T2D diagnosis) (n = 7,780)HbA1c trajectoriesTool: Not reportedMortalityTool: vital status dataExplicative: Joint latent class survival modelTime-point studied: days/1 monthFollow-up period: 2 yearsJoint Latent class growth analysis [20, 21, 22, 64]
Rathmann 2019 [65]Cohort: Diabetes Patienten Verlaufsdokumentation (years not reported)Type of data source Secondary data (registry)1-To identify groups of heterogeneous HbA1c trajectories over time in newly diagnosed T2D.Clinical researchIncident cases (n = 6,355)Comparison between groups in time: HbA1c trajectoriesPredictive model: Predictors of trajectoriesTool: Diabetes treatment, microvascular complications, sociodemographic factorsComparison between groups in time: Diabetes medication trendsPredictive model: HbA1c TrajectoriesTool: standardized using the multiple of the mean method according to the Diabetes Control and Complications trialComparison between groups: X2, Kruskal-WallisComparison between groups in time: descriptive analysisPredictive model: multinomial logistic regression, Bonferroni-Holm correctionTime-point studied: 1 yearFollow-up period: 5 yearsLatent class growth modeling [3, 66]
Schmitz 2013 [67]Cohort: Montreal Diabetes Health and Well Being Study (2008–2011)Type of data source: Primary data (surveys)1-To identify and describe longitudinal trajectories of self-rated health status in people with diabetes.Clinical researchPrevalent self-reported diabetes (n = 1,288)Trajectories of self-reported healthTool: Likert-scale, with scores from 0 to 100Global functioningTool: World Health Organization Disability Assessment Schedule IIComparison between groups: general linear model, X2Explicative model:multivariate logistic regression modelTime-point studied: 1 yearTrajectory identification follow-up: 3 yearsOutcome assessment: 1 yearTotal follow-up period: 4 yearsSemi-parametric trajectory modeling [34, 35]
Sidorenkov 2018 [68]Cohort: Groningen Initiative to Analyze Type 2 Diabetes Treatment database (2007–2013)Type of data source: Secondary data (medico-administrative, electronic health records)1-To identify subgroups of patients with T2D following distinct trajectories of HbA1c after insulin initiation.2-To explore underlying differences in clinical characteristics.Clinical researchPrevalent and incident cases with insulin initiation (n = 1,459)HbA1c trajectoriesTool: Standard laboratory measuresBaseline characteristics: Laboratory and clinical measures, socio-demographic measures, history of diabetes-related comorbidities, diabetes durationComparison between groups: one way ANOVA, Kruskal-Wallis, X2, Tukey Honestly Significant Difference testTime-point studied: 6 monthsTotal follow-up period: 4 yearsLatent class growth modeling [50]
Tsai 2019 [7]Cohort: Longitudinal Health Insurance Database 2000 (2000–2010)Type of data source: Secondary data (medico-administrative)1-To explore the longitudinal care seeking patterns of diabetic patients.2-To identify baseline characteristics associated with trajectories of care seeking behaviors.Healthcare utilization researchIncident cases (n = 3,987)Predictors of trajectories: Sociodemographic factors, severity of diabetes at diagnosis, diabetes complications, severity indexTrajectories of seeking patternsTool: Regularity of visits to specialized/generalized providers (intervals <90 days for regularity)Comparison between groups: X2 and ANOVAPredictive model: multinomial logistic regressionTime-point studied: 1 yearTotal follow-up period: 11 yearsGroup-based trajectory [34, 39, 40, 69]
Tsai 2019 [70]Cohort: Longitudinal Health Insurance Database 2000 (2000–2010)Type of data source: Secondary data (medico-administrative)1-To investigate diabetes outcomes by long-term trajectories of patients care settings among diabetes patients with regular follow-up.Healthcare utilization researchIncident cases (n = 1,268)Trajectories of care settingsTool: At least 1 visit/yearDiabetes complicationsTool: Invasive procedures using intervention and ICD-9-CM codesExplicative model: Cox proportional hazards with cumulative incidence functionTime-point studied: 1 yearTrajectory identification: 5 yearsOutcome assessment: 5 yearsFollow-up period: 10 yearsGroup-based trajectory modeling [32, 34, 39]
Vistisen 2019 [71]Cohort: Not reported (Steno Diabetes Center Copenhagen, 2001–2017)Type of data source: Secondary data (registry)1-Assessing potential heterogeneity in eGFR development among persons with diabetes and normo-albuminuria after entering stage 3 chronic kidney disease.Clinical researchPrevalent cases (subset of individuals with normo-albuminuria) and Type 1 diabetes (n = 935) or T2D (1,984)eGFR trajectoriesTool: Chronic Kidney Disease Epidemiology Collaboration standard equationBaseline characteristics: Laboratory and clinical measures, socio-demographic measures, duration of diabetes, lifestyle behaviors, medication, retinopathy statusComparison between groups: t-tests, X2Time-point studied: 1 yearFollow-up period: 10 yearsLatent class trajectory modeling [60, 72, 73]
Walraven 2015 [74]Cohort: Diabetes Care System West-Friesland (1998–2001)Type of data source:Primary and secondary data (prospective cohort, electronic health records)1-To identify subgroups of T2D patients with distinct HbA1c trajectories.2-To investigate the prevalence of microvascular complications over time.Clinical researchPrevalent cases (n = 5,423)Comparison between groups in time: HbA1c trajectoriesTool: Standard laboratory measuresPredictive model: predictors of trajectoriesTools: clinical, laboratory measures, sociodemographic factors, lifestyle behaviorsComparison between groups in time: diabetes complications over time, medication use over timeTools: Urinary albumin-creatinine ratio; EURODIAB classificationPredictive model: HbA1c trajectoriesTool: Standard laboratory measureComparison between groups: ANOVA, X2Comparison between groups in time: Binomial mixed modeling approachPredictive model: multinomial logistic backward regressionTime-point studied: 1 yearTotal follow-up period: 9 yearsLatent class growth modeling [8]
Walraven 2015 [75]Cohort: Diabetes Care System West-Friesland (1998–2010)Type of data source: Secondary data (electronic health records)1-To identify subgroups of T2D patients with distinct trajectories of systolic blood pressure levels.Clinical researchPrevalent cases (n = 5,711)Predictive model: predictors of trajectoriesTools: clinical, laboratory measures, sociodemographic factors, lifestyle behaviorsExplicative model: trajectories of blood pressure controlTool: sphygmomanometer/oscillometer device after 2003Predictive model: trajectories of blood pressure controlExplicative model: prevalence of retinopathy & microalbuminuriaTools: fundus photography & EURODIAB classification, urinary albumin-creatinine ratioTool: mortality in death registry (cause of death with ICD-9 codes)Comparison between groups: ANOVA, X2Predictive model: multinomial logistic backward regressionExplicative model:Binomial mixed modeling approach,multivariate Cox proportional modelTime-point studied: 1 yearFollow-up period: max 9 years (mean follow-up of 5.7 years)Latent class growth modeling [3, 76, 77]
Wang 2019 [78]Cohort: Five diabetic clinics in Taiwan (2014–2016)Type of data sources: Primary and secondary data (prospective cohort, medical records)1-To identify quality of life trajectory patterns and the determinants in patients with T2D.Clinical researchPrevalent cases (n = 466)Predictors of trajectories:laboratory and clinical measures, socio-demographic measures, psychosocial factors, lifestyle behaviors, complication of diabetes, insulin useTrajectories of quality of lifeTool: Diabetes-Specific Quality of Life scaleComparison between groups: X2 and ANOVAPredictive model: multinomial logistic regressionTime-point studied: 6 months.Follow-up period: 2 yearsLatent class growth analysis [21, 39, 40]
Zavrelova 2011 [79]Cohort: Diabetes Care System West-Friesland (1998–2006)Type of data source: Secondary data (electronic health records)1-To identify distinct developmental patterns of diabetic retinopathy2-To assess the risk factor levels of patients in these clusters.Clinical researchPrevalent cases (n = 3,343)Diabetic retinopathy developmental patterns (trajectories)Tool: Fundus photography, EURODIAB grading systemBaseline characteristics: laboratory and clinical measures, socio-demographic measuresComparison between groups: ANOVA with post-hoc Bonferroni or X2 tests, Kruskal-WallisTime-point studied: 1 yearFollow-up period: 6 yearsLatent class growth modeling [2, 21]

COPD: Chronic obstructive pulmonary disease, eGFR: Estimated glomerular filtration rate, EURODIAB: European Community funded Concerted Action Programme into the epidemiology and prevention of diabetes; HbA1c: Glycated hemoglobin A1c, ICD = International classification of diseases; T2D: Type 2 diabetes.

as mentioned and cited in text.

Table 2

Characteristics of studies and context of utilization of latent growth modeling approaches in the field of type 2 diabetes.

LGMM, 6 studies
LCGA, 32 studies
Total, 38 studies
NumberProportion, %NumberProportion, %NumberProportion, %
Field/context of research
 Clinical research35022692566
 Pharmaco-epidemiology/drug utilization research1172638
 Research in healthcare utilization00516513
 Psychological science23339513
Utilization of trajectory groups
 Exposure: Comparison of baseline data only00413411
 Exposure: Comparison between groups over time002625
 Exposure: Association to an outcome (explanatory model)35016501950
 Logistic regression model117413513
 Linear regression model001313
 Cox proportional model or survival analysis117825924
 Logistic or linear regression model001313
 Other/multiple models1172638
 Outcome: Descriptive only/trends002625
 Outcome: Predictive model233722924
 Both explanatory and predictive models1171325
Data sources
 Medico-administrative databases00722718
 Trials or prospective/retrospective cohorts4678251232
 Surveys00413411
 Health records/registry117413513
 Medico-administrative database and health records11739411
 Clinical studies & health records00516513
 Mixed data sources001313
Population
 Incident cases of T2D233619821
 Recruitment max 2 years after T2D diagnosis002625
 Prevalent cases of T2D46722692668
 Both, prevalent and incident cases of T2D002625

LCGA: Latent class growth analysis, LGMM: Latent growth mixture modeling, T2D: type 2 diabetes.

PRISMA 2020 Flow diagram of included studies [10]. Characteristics of studies using latent growth modeling approaches among patients with type 2 diabetes. COPD: Chronic obstructive pulmonary disease, eGFR: Estimated glomerular filtration rate, EURODIAB: European Community funded Concerted Action Programme into the epidemiology and prevention of diabetes; HbA1c: Glycated hemoglobin A1c, ICD = International classification of diseases; T2D: Type 2 diabetes. as mentioned and cited in text. Characteristics of studies and context of utilization of latent growth modeling approaches in the field of type 2 diabetes. LCGA: Latent class growth analysis, LGMM: Latent growth mixture modeling, T2D: type 2 diabetes. The objectives of the studies evaluating trajectory groups could be separated in three main categories. (1) The description/comparison of trajectory groups: Twenty-nine (76%) of studies compared baseline characteristics of groups using comparison tests, such as t-tests, chi-square, ANOVA, Wilcoxon or Kruskal-Wallis tests. Three studies evaluated the trends of a specific outcome according to the trajectory groups, either descriptive analyses, Cochrane Armitage test or binomial mixed models. One study depicted trajectory groups in graphs, without further analysis. (2) Predictive model of group membership: Eleven studies used a predictive model (multivariate, bivariate, polytomous or cumulative logistic regression models) to identify predictors of belonging to the different trajectory groups. (3) Trajectory groups as exposition to an outcome of interest: Twenty-one studies used a latent growth modeling approach as a first step to identify exposure groups, then used regression models to test the association between trajectory groups and outcome(s) of interest (univariate and/or multivariate linear or logistic regression models, Kaplan-Meier, Cox proportional hazard model, joint survival analysis). From these studies, two evaluated the exposition trajectory groups and outcomes simultaneously [6, 11], while the rest (n = 19) identified trajectory groups during a given period, then evaluated outcomes during a subsequent period following the end of the trajectory assessment. A total of 6 studies considered within-class heterogeneity by using LGMM [12, 13, 14, 15, 16] or a General Growth Class Mixture Modeling (GGCMM) for randomized intervention [17, 18]. Most studies (n = 32) used LCGA. Although citing the same references, the terminology used by the investigators varied between LCGM, GBTM, LCGA, “Latent class trajectory analyses”, “group-based trajectory”, “group-based semi-parametric mixture modeling approach”, “semi-parametric group-based modeling strategy” or “multinomial modeling strategy”. One study used a dual trajectory modeling, which consisted of identifying trajectory groups for two distinct variables (body mass index and disability trajectories) [19]. One study used a joint LCGA to assess the association between HbA1c trajectories and mortality, in which survival analysis and trajectory group identification was performed simultaneously [20].

Quality of reporting using the GRoLTS checklist

The summary of the GRoLTS checklist items across studies is presented in Table 3. Well-reported items included the metric of time used, the statistical software used, the characteristics and graphical depiction of the final model and confirmation of group membership of the final solution using average posterior probability and/or entropy. Ninety-two percent of studies reported the tools used for trajectory identification, from which 94% used the Bayesian information criterion (BIC). The number of latent groups identified varied between 2 and 7, where most studies identified either 3 (n = 12), 4 (n = 15) or 5 trajectory groups (n = 7). Items partially reported included missing data mechanisms and management, the total number of fitted models tested or the distribution of observed data and the functional forms of trajectories in the final model. The items poorly reported included the consideration of mean and variance of time as well as within-class heterogeneity, the alternative across-class variance-covariance matrices considered, the number of random start values, the number of cases or graphical depiction of other models tested and availability of syntax files. From the 21 questions covered by the GRoLST checklist, the total of elements reported ranged from 3 to 17 within studies while most studies adequately reported 7 to 10 elements. The details of the items for individual studies are available in Supplemental Material.
Table 3

Assessing the quality of reporting of latent growth modeling approaches using the GRoLTS checklist.

ItemsAnswerN%Comments
1. Is the metric of time used in the statistical model reported?Yes38100The metric of time used was reported either in text, graph or both. Most studies used years/months or weeks from baseline, one study used the age of participants and two studies used time prior/after an index date. The spacing between points were also adequately reported.
No00
Unclear00
2. Is information presented about the mean and variance of time within a wave?Yes25Most studies did not mention if variation across individuals' time intervals were present, or if data were analysed as time-unstructured or time-structured. From the 2 studies who considered the variance of time, details on consideration of time variance remains sparse; one study modelled time with both fixed and random effects. One study mention that the function of time was freed across groups.
No3592
Unclear13
3a. Is the missing data mechanism reported?Yes821Eight studies clearly identified missing data mechanisms. Five studies mentioned the causes of missing data, such as loss in follow-up (2 studies), censoring (1 study) or exclusion from dataset (2 studies). Most studies did not report the mechanism of missing data.
No2566
Unclear513
3b. Is a description provided of what variables are related to attrition/missing data?Yes1437Fourteen studies presented a clear comparison between the characteristics of included/excluded individuals or complete/incomplete datasets. Two studies performed sensitivity analyses comparing the models with/without individuals with missing data. The other studies did not describe the variables with missing data.
No2463
3c. Is a description provided of how missing data in the analyses were dealt with?Yes2053Eighteen studies used exclusion of individuals with missing data from the dataset or exclusion of the last follow-up time-points. Two studies used full information maximum likelihood (FIML) estimation and one study used both exclusion and FIML. One study used multiple imputation and two studies explored the mechanism of missing data using model comparison. The other studies did not report how missing data was dealt with.
No1847
4. Is information about the distribution of the observed variables included?Yes1437From the studies who reported the distribution of observed variables, 13 reported a censored normal distribution and one study used a logit distribution.
No2463
5. Is the software mentioned?Yes38100All studies reported the statistical software used for trajectory modeling, 33 of which reported the version used. MPlus software was used in 9 studies, R was used in 5 studies (package lcmm reported in 3 studies); 16 studies used SAS (13 studies reported using the PROC TRAJ procedure), 7 studies used the “traj” plug-in in STATA, 1 study used MLwin.
No00
6a. Are alternative specifications of within-class heterogeneity considered (e.g., LCGA vs. LGMM) and clearly documented? If not, was sufficient justification provided as to eliminate certain specifications from consideration?Yes718Six studies considered within-class heterogeneity using either LGMM or GGCMM. Two studies with unclear reporting added within-class confidence intervals on time intervals, although reporting using LCGA [43, 67] and 1 study mentioned considering within-class heterogeneity in text, although reporting using LCGA.
No2874
Unclear38
6b. Were alternative specifications of the across-class variance-covariance matrix structure considered? If not, was sufficient justification provided as to eliminate certain specifications from consideration?Yes00Three studies mentioned the matrix structure considered; 2 studies assumed an auto-regression correlation and one study assumed a constant variance-covariance structure, without precising the matrix chosen. No study considered alternative variance-covariance structure or justify the utilization of the chosen matrices. The other studies did not report information about across-class variance-covariance matrix structure. To note, LCGA assumes conditional independence of individuals at each point in time.
No38100
7. Are alternative shape/functional forms of the trajectories described?Yes1950From the 19 studies that reported considering alternative shapes of trajectories, linear, quadratic and cubic shapes where the shapes commonly evaluated for trajectory modeling. The shape of trajectories varied from linear functional forms (n = 7), quadratic (n = 4), cubic (n = 4) or mixed shapes (linear and/or quadratic and/or cubic) (n = 6). One study mentioned comparing alternative shapes in the method section, the results of which were not presented. From the remaining 18 studies, 9 studies only reported the trajectory shape(s) of the final solution, and nine studies did not report the final shape of the trajectories (although the shapes could sometimes be guessed from the graphs).
No1847
Unclear13
8. If covariates have been used, can analyses still be replicated?Yes411Thirty-one studies did not report using covariates for the identification of trajectory groups. Seven studies used covariates to predict the growth parameters/class membership. From these, one studies included both fixed and time-varying covariates (either in trajectory modeling or subsequent explanatory modeling), while 5 studies only considered fixed covariates. From our analysis, 4 studies gave sufficient details for replication.
No25
Unclear13
No covariates used3182
9. Is information reported about the number of random start values and final iterations included?Yes13One study reported rerunning the programs with multiple iterations of random start values. One study considered mentioned using a gridsearch function within the method in order to take in account convergence towards local maxima, yet did not mention iterations with random start value. The remaining studies did not report this information.
No3797
10. Are the model comparison (and selection) tools described from a statistical perspective?Yes3592From the 35 studies using model comparison tools, 33 used the BIC, either alone (21 studies) or in combination with other tools (13 studies). The other tools used were AIC, the Bayes factor, Vuong-lo-mendell-rubin likelihood ratio test, Nagin's criteria for adequacy along with other considerations such as posterior-probability, entropy, clinical relevance or interpretability. A total of 16 studies observed a steadily decrease in BIC, 7 studies observed some disagreement between the tools. Thirteen studies considered a minimal sample size threshold for each trajectory, going from >5% of the study population (5 studies); >1% (2 studies); >3% (1 study), >10 individuals per class (1 study). Three studies mentioned their intention to impose a minimum percentage of individuals, without precision regarding the threshold. From the 35 studies, 25 studies were lacking details on the interpretation of tool and how the decision of the number of trajectories was made, 15 of which did not report their decision.
No25
Insufficient13
11. Are the total number of fitted models reported, including a one-class solution?Yes1642A total of 22 studies reported the number of trajectories in models compared, from which 16 included a one-class solution. The maximum number of trajectories tested went from 3 to 8. Six studies used the one trajectory model for comparison.
No one-class solution616
No1642
12. Are the number of cases per class reported for each model (absolute sample size, or proportion)?Yes718Despite not reporting for all the models tested, all studies reported the number or the proportion of participants in each trajectory for the final model.
No3182
13. If classification of cases in a trajectory is the goal, is entropy or the number of misclassifications reported?Yes3387All studies had the goal of classifying individuals in specific trajectories, 7 studies reported calculating entropy of the chosen model and 27 studies reported average posterior probability of class membership, from which 14 studies set a minimal threshold going from >0.5 to >0.8. Five studies used entropy to choose between models.
No513
14a. Is a plot included with the estimated mean trajectories of the final solution?Yes3797The majority of studies presented the trajectory groups for the final solution, while 4 studies presented graphically each model tested (usually presented in supplemental material). For studies using LGMM, confidence intervals or other dispersion measures were not presented. Individual trajectories were presented in one study.
No13
14b. Are plots included with the estimated mean trajectories for each model?Yes411
No3489
14c. Is a plot included of the combination of estimated means of the final model and the observed individual trajectories split out for each latent class?Yes13
No3797
15. Are characteristics of the final class solution numerically described (i.e., means, SD/SE, n, CI, etc.)?Yes3284Most studies presented the baseline characteristics of the different trajectory groups identified.
No616
16. Are the syntax files available (either in the appendix, supplementary materials, or from the authors)?Yes13No study made their syntax publicly available; one study reported the possibility of sharing syntax files by contacting the authors. In the supplemental materials, 3 studies gave additional details on their decisions for trajectory modeling or presented complementary methodological content.
No3797

GRoLTS checklist from van de Schoot et al. (2016) [4].

AIC: Akaike information criterion, BIC: Bayesian information criterion, CI: Confidence intervals, FIML: Full information maximum likelihood, GBTM: group-based trajectory modeling, GGCMM: general Growth Class Mixture Modeling, LCGA: Latent Class Growth Analysis, LCGM: Latent Class Growth Modeling; LGMM: Latent Growth Mixture Modeling, SD: Standard deviation, SE: Standard error.

Assessing the quality of reporting of latent growth modeling approaches using the GRoLTS checklist. GRoLTS checklist from van de Schoot et al. (2016) [4]. AIC: Akaike information criterion, BIC: Bayesian information criterion, CI: Confidence intervals, FIML: Full information maximum likelihood, GBTM: group-based trajectory modeling, GGCMM: general Growth Class Mixture Modeling, LCGA: Latent Class Growth Analysis, LCGM: Latent Class Growth Modeling; LGMM: Latent Growth Mixture Modeling, SD: Standard deviation, SE: Standard error.

Discussion

Major findings

This review depicted the utilization of latent growth modeling approaches in longitudinal studies conducted among individuals with T2D. The heterogeneity in environmental, genetic, socio-economic factors influencing the outcomes of patients with T2D throughout time justifies the relevance of using latent growth modeling approaches [1, 2]. Our results brought evidence on the wide diversity of contexts of utilization, follow-up periods, type of data sources and research questions considered using these methods. In this first literature review on the topic to our knowledge, we observed an overall preference in the choice of LCGA over LGMM. In contrast, we did not observe any context of utilization specific to a method or another. Finally, this review identified some recurrent issues in reporting, notably on consideration of within-group heterogeneity, how the decisions were made or the management of missing data, as examples.

Current applications and limits of latent growth modeling approaches

The research questions covered in the included studies were diverse. Indeed, only a handful of studies limited their analysis to simply identify trajectory groups; most studies used the identified groups to answer subsequent questions. The utilization of trajectory groups as exposures to a clinical outcome was used in half of included studies. However, some issues can be documented with the utilization of trajectory groups as exposures. First, most authors used a two-step approach, by identifying trajectory groups during an initial follow-up period, then using a regression model during a subsequent follow-up period for outcome assessment [6]. This design imposes individuals to survive up to the beginning of the second period in order to be considered in the analysis. Depending on the length of the follow-up, this 2-step approach may introduce an immortal time bias, notably because individuals must be exempt from the outcome of interest (and above all: survive) during the initial period of group identification [6]. Unfortunately, no study presented the potential impact of this immortal time bias on their results. Furthermore, given the broken temporality between group trajectory identification and outcome assessment, the clinical applicability of results is likely reduced [6]. In alternative, two studies evaluated trajectories and clinical outcomes simultaneously. This approach tackles the issue of immortal time bias yet might bring other issues about whether the temporality between the exposure and the outcome is adequately respected. Another issue of using latent classes in regression models is that the statistical uncertainty associated with finding the latent classes is not taken into account in the subsequent analysis. [80] While this may be expected to lead to underestimating the uncertainty of the regression parameters, it has been argued that this process leads to adequate large sample inferences if the data on the outcome are not used for the latent growth modeling step [80]. However, in case of poor classification (i.e.: low average posterior probabilities and/or low entropy), the interpretation of regression parameters using uncertain classes should be done with caution. Second, another issue with the utilization of trajectory groups as exposures is the adjustment of covariates. Given the long follow-up expected in T2D progression, the advent of concurrent events or life changes might affect the occurrence of the outcome of interest [3]. In the context of a long-term follow-up, as the trajectories can vary in time, some covariates may be time-dependent and thus, may act as confounding and/or intermediate factors [80]. In other words, covariates may have an impact on both the exposure and the outcome of interest at some point in time; yet these might be influenced by the trajectory itself and become a mediator in the association. Both ignoring or adjusting for these changing covariates using standard approaches could lead to invalid estimates, especially when trajectory groups are modelized as exposures [80]. Only a few studies clearly reported having considered covariates to identify trajectory groups, from which a minority integrated time-varying covariates in the models. Third, many studies included prevalent cases of T2D with differences in the duration of the disease. The non-consideration of T2D duration, for instance, might create some systematic distortions in the association between trajectory groups and a given outcome. Many strategies can be used for tackling this confounder, notably by restricting to incident cases, which was done by some researchers [16, 58, 65, 70]. Even though improving internal validity, restricting the selection of cases could affect generalizability of results. One study used propensity score matching [15], while other studies adjusted explanatory models with T2D duration or severity of T2D [37, 58, 58, 59, 63, 67]. In sum, although using trajectory groups as exposure to an outcome was the most preferred approach in the included studies, one should recognize its complexity and should be aware of the limits inherent to its utilization.

Needs for improvement in the quality of reporting of latent growth modeling approaches

We identified many recurring issues in the quality of reporting using the GRoLTS checklist. Among the most important issues noted stands the vast utilization of LCGA, without considering potential residual within-group heterogeneity, which could lead to important consequences on the interpretability and inference of results. In accordance, we observed a certain confusion of the “within-group heterogeneity” concept; Some investigators mentioned using models such as LGMM without reporting how within-group heterogeneity was estimated, or on the opposite, mentioned information about within-group heterogeneity while using LCGA. In most cases, adequate reporting on the matter was lacking. Another important aspect in LGMM and LCGA is the utilization of tools/criteria to identify the optimal number of trajectory groups. The BIC was by far the most used tool for model identification, in accordance with what is usually recommended in the literature [4]. Other tools have been proven efficient to support the BIC, such as the Bayes Factor, the Lo-Mendell-Rubin Likelihood ratio test, the bootstrap likelihood ratio test, the Akaike information criterion and other subjective assessments such as clinical relevance, parsimony and interpretability [3, 81]. Although the criteria for model selection were globally well reported, we observed some reporting issues regarding how these tools were actually used for group identification. Indeed, information was often missing on the results of the tools between the models tested (statistical or subjective criterion), how inconsistencies were managed and how the final solution was identified. Another element often missing in the included studies is the reporting of the number of different models tested, including the one-class model. The GRoLST-Checklist recommend testing the one-class model, which could fit the data best in comparison with multiple-classes models. Although adequation criteria (e.g. BIC) are usually used to compare models, these criteria could be misleading and favorize more complex models [72]. The final decision should thus combine adequation criteria with clinical plausibility of one-class versus multiple-classes models. Another strategy would be to visualize individual trajectories from raw data and evaluate if these individual patterns are concordant with the average trajectory identified [72]. This issue highlights all the complexity and subjectivity in the selection of the optimal trajectory class model and further supports the importance of rigorous reporting of the decision process within the scientific literature. We observed that two-third of the included studies used the average posterior probability as the favourite tool to assess model adequacy, while only 5 studies reported entropy. Usually, model adequacy is confirmed by ensuring all average posterior probability exceed a minimal threshold which is recommended at 0.7 [3]. In the included studies, however, 14 studies reported a clear threshold (between 0.5 and 0.8), while 13 studies mentioned considering average posterior probability without detailing how the tool was used. Finally, we observed a major issue with the treatment of missing data. Latent growth modeling approaches are renowned for their flexibility with irregular datasets, where it remains possible to identify trajectories even if some data are missing, but only if these data are missing at random or completely at random [3, 82]. However, a minority of studies clearly reported the nature of missing data [83].

The non-standardized terminology associated with latent growth modeling approaches

In a global perspective, we felt important to emphasize the great diversity in the terminology related to latent growth modeling approaches. During the process of this review, we also encountered the term “trajectory” in a wide variety of contexts, sometimes as statistical methods, as indicators of processes of care, as a description of a prospective phenomenon or sometimes purely arbitrary. This non-standardized terminology illustrates the importance of consistency in trajectory evaluation, in a way of clarifying and improving their use in future studies. Overall, we observed some issues relating to transparency and quality of report of latent growth modeling approaches, which unfortunately limit the comparability between studies [4]. This lack of information might reflect some confusion in the utilization of these methods, which further justifies the presence of experienced biostatisticians while planning latent growth modeling approaches, no matter the method used.

Strengths and limitations

This review has strengths, notably the utilization of a rigorous and objective methodology inspired by the Cochrane handbook for systematic reviews of interventions [84]. However, the review has limitations, notably the absence of two reviewers for abstracts and studies selections as well as data collection/extraction. Regarding the review process, we restricted to peer-reviewed published studies only. However, given the novelty of latent growth modeling approaches in the field of type 2 diabetes, some studies reported in conference abstracts or grey literature may have been missed. The review may thus be subject to publication bias. The generalizability of the findings is limited to T2D, even though similarities could be expected with other cardiometabolic/chronic diseases such as hypertension or dyslipidemia. Finally, we decided to exclude studies on type 1 diabetes due to the differences of its inherent pathophysiology and clinical management compared to T2D.

Conclusions

In conclusion, this literature review reports the relevance, the application and the issues related to latent growth modeling approaches in the study of T2D. We encourage future investigators interested into latent growth modeling approaches to consider the following suggestions: (1) to use caution when trajectory groups are treated as exposure in explanatory models, (2) to carefully choose the terminology used to characterize trajectory models in light of current literature, (3) to acknowledge the complexity behind latent growth modeling approaches by involving an experienced statistician in the process and finally, (4) to ensure that methods are reported transparently and comprehensively.

Declarations

Author contribution statement

All authors listed have significantly contributed to the development and the writing of this article.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data will be made available on request.

Declaration of interest's statement

The authors declare the following conflict of interests: Sarah O'Connor had received a prize award (unrelated to this work) from the Faculty of Pharmacy, Université Laval, the funds were provided by Pfizer Canada (June 2021). Dr Poirier was a member of the Clinical Practice Guidelines of the Canadian Diabetes Association (macrovascular complications). Although not relevant to this work, Dr Paul Poirier declared having received fees for CME/consultants from Abbott, Amgen, Astrazeneca, Bayer, Bausch Health, Boehringer Ingelheim, Eli Lilly, HLS Therapeutics Inc, Janssen, Novartis, NovoNordisk, Sanofi and Servier. Jacinthe Leclerc is a professor of nursing at Université du Québec à Trois-Rivières. Within her role of professor, she provides (1) Continuous Medical Education sessions for health care professionals, accredited by the Fédération des médecins omnipraticiens du Québec and its local affiliates and (2) statistical expertise on Data Safety Monitoring Board Committees managed by JSS Medical Research (both unrelated to this work). Other authors declare no conflict of interests.

Additional information

No additional information is available for this paper.
  62 in total

1.  Progressive Decline in Estimated Glomerular Filtration Rate in Patients With Diabetes After Moderate Loss in Kidney Function-Even Without Albuminuria.

Authors:  Dorte Vistisen; Gregers Stig Andersen; Adam Hulman; Frederik Persson; Peter Rossing; Marit Eika Jørgensen
Journal:  Diabetes Care       Date:  2019-06-20       Impact factor: 19.112

2.  Regularized Latent Class Model for Joint Analysis of High-Dimensional Longitudinal Biomarkers and a Time-to-Event Outcome.

Authors:  Jiehuan Sun; Jose D Herazo-Maya; Philip L Molyneaux; Toby M Maher; Naftali Kaminski; Hongyu Zhao
Journal:  Biometrics       Date:  2018-12-05       Impact factor: 2.571

3.  Patterns of Adherence to Oral Hypoglycemic Agents and Glucose Control among Primary Care Patients with Type 2 Diabetes.

Authors:  Heather F de Vries McClintock; Knashawn H Morales; Dylan S Small; Hillary R Bogner
Journal:  Behav Med       Date:  2014-10-10       Impact factor: 3.104

4.  Severe diabetes complications among patients with diabetes with regular follow-up: Does care setting matter?

Authors:  Tzu-Ho Tsai; Nicole Huang; I-Feng Lin; Yiing-Jenq Chou
Journal:  J Eval Clin Pract       Date:  2019-11-06       Impact factor: 2.431

5.  Integrating person-centered and variable-centered analyses: growth mixture modeling with latent trajectory classes.

Authors:  B Muthén; L K Muthén
Journal:  Alcohol Clin Exp Res       Date:  2000-06       Impact factor: 3.455

6.  Determinants for quality of life trajectory patterns in patients with type 2 diabetes.

Authors:  Ruey-Hsia Wang; Kuan-Chia Lin; Hui-Chun Hsu; Yau-Jiunn Lee; Shyi-Jang Shin
Journal:  Qual Life Res       Date:  2018-10-01       Impact factor: 4.147

7.  Measures of discrimination for latent group-based trajectory models.

Authors:  Nilesh H Shah; Alison E Hipwell; Stephanie D Stepp; Chung-Chou H Chang
Journal:  J Appl Stat       Date:  2015-01       Impact factor: 1.404

8.  Group-based trajectory models: a new approach to classifying and predicting long-term medication adherence.

Authors:  Jessica M Franklin; William H Shrank; Juliana Pakes; Gabriel Sanfélix-Gimeno; Olga S Matlin; Troyen A Brennan; Niteesh K Choudhry
Journal:  Med Care       Date:  2013-09       Impact factor: 2.983

9.  Variation in the 11-year trajectories of medical care seeking behaviors in diabetes patients under a single payer system: persisting gaps to be filled.

Authors:  Tzu-Ho Tsai; Nicole Huang; I-Feng Lin; Yiing-Jenq Chou
Journal:  BMC Health Serv Res       Date:  2019-08-19       Impact factor: 2.655

10.  Post-load glucose subgroups and associated metabolic traits in individuals with type 2 diabetes: An IMI-DIRECT study.

Authors:  Morgan Obura; Joline W J Beulens; Roderick Slieker; Anitra D M Koopman; Trynke Hoekstra; Giel Nijpels; Petra Elders; Robert W Koivula; Azra Kurbasic; Markku Laakso; Tue H Hansen; Martin Ridderstråle; Torben Hansen; Imre Pavo; Ian Forgie; Bernd Jablonka; Hartmut Ruetten; Andrea Mari; Mark I McCarthy; Mark Walker; Alison Heggie; Timothy J McDonald; Mandy H Perry; Federico De Masi; Søren Brunak; Anubha Mahajan; Giuseppe N Giordano; Tarja Kokkola; Emmanouil Dermitzakis; Ana Viñuela; Oluf Pedersen; Jochen M Schwenk; Jurek Adamski; Harriet J A Teare; Ewan R Pearson; Paul W Franks; Leen M 't Hart; Femke Rutters
Journal:  PLoS One       Date:  2020-11-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.