| Literature DB >> 17880699 |
Mariel M Finucane1, Jeffrey H Samet, Nicholas J Horton.
Abstract
Longitudinal studies are helpful in understanding how subtle associations between factors of interest change over time. Our goal is to apply statistical methods which are appropriate for analyzing longitudinal data to a repeated measures epidemiological study as a tutorial in the appropriate use and interpretation of random effects models. To motivate their use, we study the association of alcohol consumption on markers of HIV disease progression in an observational cohort. To make valid inferences, the association among measurements correlated within a subject must be taken into account. We describe a linear mixed effects regression framework that accounts for the clustering of longitudinal data and that can be fit using standard statistical software. We apply the linear mixed effects model to a previously published dataset of HIV infected individuals with a history of alcohol problems who are receiving HAART (n = 197). The researchers were interested in determining the effect of alcohol use on HIV disease progression over time. Fitting a linear mixed effects multiple regression model with a random intercept and random slope for each subject accounts for the association of observations within subjects and yields parameters interpretable as in ordinary multiple regression. A significant interaction between alcohol use and adherence to HAART is found: subjects who use alcohol and are not fully adherent to their HIV medications had higher log RNA (ribonucleic acid) viral load levels than fully adherent non-drinkers, fully adherent alcohol users, and non-drinkers who were not fully adherent. Longitudinal studies are increasingly common in epidemiological research. Software routines that account for correlation between repeated measures using linear mixed effects methods are now generally available and straightforward to utilize. These models allow the relaxation of assumptions needed for approaches such as repeated measures ANOVA, and should be routinely incorporated into the analysis of cohort studies.Entities:
Year: 2007 PMID: 17880699 PMCID: PMC2147003 DOI: 10.1186/1742-5573-4-8
Source DB: PubMed Journal: Epidemiol Perspect Innov ISSN: 1742-5573
Figure 1Observed log(RNA+1), adherence and abstinence status over time for 9 subjects.
Figure 2Hypothetical observed and predicted lines for two subjects from random intercept and random slope model.
Characteristics of the HIV-ALC Cohort on HAART at baseline (n = 197)
| Percent | Count | |
| Primary HIV risk factor | ||
| Men sex with men | 21% | 42 |
| Injection drug use | 58% | 115 |
| Heterosexual sex | 20% | 40 |
| Race/ethnicity | ||
| Black | 41% | 80 |
| White | 37% | 73 |
| Latino | 22% | 43 |
| Other | 1% | 1 |
| Uses alcohol | 40% | 79 |
| Female | 18% | 36 |
| Homeless | 22% | 43 |
| Enrollment year | ||
| 1997 | 10% | 19 |
| 1998 | 33% | 65 |
| 1999 | 37% | 72 |
| 2000 | 16% | 32 |
| 2001 | 5% | 9 |
| ADHERE enrollment | ||
| Not enrolled | 49% | 96 |
| Control | 26% | 52 |
| Intervention | 25% | 49 |
| Mean (SD) | min, max | |
| Doses of HAART/day | 5.0 (1.6) | 2, 10 |
| 3 day HAART adherence | 0.9 (0.2) | 0, 1 |
| Age | 40.8 (7.4) | 19.5,66.2 |
| Log10(RNA+1) | 2.0 (1.9) | 0, 5.7 |
Note: due to rounding some values may not sum to 100%
Summary of LME Model of Log10(RNA+1) (n = 618 observations derived from 197 subjects)
| Est (SE) | p-value | Multiple | |
| Intercept | 1.9 (.98) | .06 | |
| Time | .37 ( | ||
| Time0 | .58 (.48) | .23 | |
| Time6 | .69 (.48) | .15 | |
| Time12 | .84 (.48) | .08 | |
| Time18 | .88 (.50) | .08 | |
| Time24 | .58 (.49) | .24 | |
| Time30 | .18 (.49) | .72 | |
| Time36 | 0 | . | |
| Drink | 3.6 (.97) | .0003 | |
| Adherence | -.39 (.58) | .50 | |
| Time*Drink | .01 ( | ||
| Time0*Drink | -1.7 (.81) | .04 | |
| Time6*Drink | -2.2 (.81) | .006 | |
| Time12*Drink | -2.4 (.82) | .003 | |
| Time18*Drink | -1.9 (.82) | .02 | |
| Time24*Drink | -1.5 (.84) | .07 | |
| Time30*Drink | -.93 (.85) | .27 | |
| Time36*Drink | 0 | . | |
| Drink*Adherence | -1.6 (.67) | .02 | |
| Age | -.02 (.01) | .12 | |
| Female | .21 (.27) | .45 | |
| Homeless | .04 (.19) | .85 | |
| Doses/day | -.02 (.05) | .65 | |
| Enrollment year | .22 (.11) | .04 | |
| Race/ethnicity | .22 ( | ||
| Black | .21 (.23) | .37 | |
| Latino | .04 (.28) | .88 | |
| Other | 1.8 (.93) | .05 | |
| White | 0 | . | |
| ADHERE assignment | .27 ( | ||
| Non ADHERE | -.39 (.24) | .11 | |
| ADHERE treatment | -.16 (.25) | .17 | |
| ADHERE control | 0 | . | |
| Primary HIV risk factor | .25 ( | ||
| Men sex with men | .46 (.33) | .17 | |
| Injection drug use | .44 (.27) | .11 | |
| Heterosexual sex | 0 | . |
Figure 3log(RNA+1) predicted values over time from random intercept and slope model by adherence and abstinence status.
Summary of Linear Regression Model of Log10(RNA+1) at baseline (n = 197 observations)
| Est (SE) | p-value | Multiple | |
| Intercept | 1.0 (1.3) | .43 | |
| Drink | 2.6 (1.2) | .03 | |
| Adherence | -.37 (1.0) | .71 | |
| Drink*Adherence | -2.4 (1.2) | .05 | |
| Age | 0.0 (.02) | .98 | |
| Female | .58 (.37) | .12 | |
| Homeless | .44 (.32) | .18 | |
| Doses/day | -.07 (.09) | .43 | |
| Enrollment year | .32 (.14) | .02 | |
| Race/ethnicity | .94 ( | ||
| Black | .06 (.32) | .84 | |
| Latino | .06 (.36) | .88 | |
| Other | 1.2 (1.8) | .53 | |
| White | 0 | . | |
| ADHERE assignment | .36 ( | ||
| Non ADHERE | -.12 (.33) | .73 | |
| ADHERE treatment | .35 (.37) | .35 | |
| ADHERE control | 0 | . | |
| Primary HIV risk factor | .13 ( | ||
| Men sex with men | .83 (.7) | .08 | |
| Injection drug use | .69 (.37) | .06 | |
| Heterosexual sex | 0 | . |
Figure 4Histogram of random intercepts from random slope model (plus normal density).
Figure 5Histogram of random slopes from random slope model (plus normal density).
Figure 6Scatterplot of random intercepts and random slopes (9 subjects displayed in Figure 1 are indicated by their identification numbers).