| Literature DB >> 33028929 |
Ashenafi A Yirga1, Sileshi F Melesse2, Henry G Mwambi2, Dawit G Ayele3.
Abstract
It is of great interest for a biomedical analyst or an investigator to correctly model the CD4 cell count or disease biomarkers of a patient in the presence of covariates or factors determining the disease progression over time. The Poisson mixed-effects models (PMM) can be an appropriate choice for repeated count data. However, this model is not realistic because of the restriction that the mean and variance are equal. Therefore, the PMM is replaced by the negative binomial mixed-effects model (NBMM). The later model effectively manages the over-dispersion of the longitudinal data. We evaluate and compare the proposed models and their application to the number of CD4 cells of HIV-Infected patients recruited in the CAPRISA 002 Acute Infection Study. The results display that the NBMM has appropriate properties and outperforms the PMM in terms of handling over-dispersion of the data. Multiple imputation techniques are also used to handle missing values in the dataset to get valid inferences for parameter estimates. In addition, the results imply that the effect of baseline BMI, HAART initiation, baseline viral load, and the number of sexual partners were significantly associated with the patient's CD4 count in both fitted models. Comparison, discussion, and conclusion of the results of the fitted models complete the study.Entities:
Mesh:
Year: 2020 PMID: 33028929 PMCID: PMC7541535 DOI: 10.1038/s41598-020-73883-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Distribution of CD4 count and associated selected covariates with percent missing.
| Covariates | Level | CD4 count N (%) | p-value | % Missing | ||
|---|---|---|---|---|---|---|
| < 200 | 200–500 | > 500 | ||||
| Baseline BMI category | Underweight | 2 (0.03) | 219 (3.12) | 254 (3.62) | < 0.0001 | 0.0 |
| Normal weight | 114 (1.62) | 2305 (32.84) | 2690 (38.32) | |||
| Overweight | 18 (0.26) | 512 (7.29) | 657 (9.36) | |||
| Obese | 0 | 17 (0.24) | 231 (3.29) | |||
| Baseline viral load | Undetected | 0 | 0 | 16 (0.23) | < 0.0001 | 0.0 |
| Low | 20 (0.28) | 791 (11.27) | 1532 (21.83) | |||
| Medium | 45 (0.64) | 1209 (17.22) | 1497 (21.23) | |||
| High | 69 (0.98) | 1053 (15) | 787 (11.21) | |||
| Number of sexual partners | No partner | 29 (0.41) | 565 (8.05) | 579 (8.25) | < 0.0001 | 0.0 |
| Stable partner | 85 (1.21) | 2274 (32.4) | 3078 (43.85) | |||
| Many partners | 20 (0.28) | 214 (3.05) | 175 (2.49) | |||
| Age group | < 20 | 1 (0.01) | 130 (1.82) | 121 (1.72) | < 0.0001 | 0.0 |
| 20–29 | 97 (1.38) | 1872 (26.67) | 1977 (28.17) | |||
| 30–39 | 17 (0.24) | 813 (11.58) | 1255 (17.88) | |||
| 40–49 | 19 (0.27) | 203 (2.89) | 369 (5.26) | |||
| 50–59 | 0 | 35 (0.5) | 91 (1.3) | |||
| ≥ 60 | 0 | 0 | 19 (0.27) | |||
| Educational level | Primary school | 3 (0.04) | 104 (1.48) | 181 (2.58) | 0.0129 | 0.0 |
| Secondary school | 131 (1.87) | 2949 (42.01) | 3651 (52.02) | |||
| Place of residence | Rural | 62 (0.88) | 1467 (20.90) | 1806 (25.73) | 0.7176 | 0.06 |
| Urban | 72 (1.03) | 1586 (22.6) | 2026 (28.86) | |||
| ART initiation group | Pre ART | 110 (1.57) | 2566 (36.56) | 2783 (39.65) | < 0.0001 | 0.0 |
| Post ART | 20 (24) | 487 (6.94) | 1049 (14.95) | |||
The response variable (CD cell count) has 110 (1.5%) missing observations.
Figure 1Individual profiles plot of CD4 cell count for 17 randomly selected subjects.
Comparisons of fit statistics for the two distributions.
| Distribution | Fit statistics | |||||
|---|---|---|---|---|---|---|
| − 2 log likelihood | AIC | AICC | BIC | CAIC | HQIC | |
| Poisson | 204,842.9 | 204,892.9 | 204,893.1 | 204,979.4 | 205,004.4 | 204,927.8 |
| NB | 87,781.28 | 87,833.28 | 87,833.48 | 87,923.23 | 87,949.23 | 87,869.54 |
Measure of over-dispersion between Poisson and negative binomial distribution.
| Fit Statistics for Conditional Distribution | Poisson | NB |
|---|---|---|
| − 2 log L(CD4 counts/r. effects) | 199,670.3 | 85,320.39 |
| Pearson | 145,017.0 | 6396.89 |
| Pearson | 20.66 |
Figure 2Data-scale raw residuals and Model-scale studentized residuals versus predicted values.
Comparison of random effect models.
| Random effect models | Information criteria | |||||
|---|---|---|---|---|---|---|
| − 2log | AIC | AICC | BIC | CAIC | HQIC | |
| Model 1 | 87,781.28 | 87,833.28 | 87,833.48 | 87,923.23 | 87,949.23 | 87,869.54 |
| Model 2 | 88,603.50 | 88,649.50 | 88,649.66 | 88,729.07 | 88,752.07 | 88,681.58 |
| Model 3 | 88,591.64 | 88,637.64 | 88,637.80 | 88,717.21 | 88,740.21 | 88,669.72 |
| Model 4 | 89,156.39 | 89,202.39 | 89,202.55 | 89,281.96 | 89,304.96 | 89,234.47 |
| Model 5 | 89,837.18 | 89,879.18 | 89,879.31 | 89,951.83 | 89,972.83 | 89,908.47 |
| Model 6 | 92,302.08 | 92,344.08 | 92,344.21 | 92,416.73 | 92,437.73 | 92,373.37 |
| Model 7 | 91,190.61 | 91,232.61 | 91,232.74 | 91,305.26 | 91,326.26 | 91,261.90 |
Type III Analysis of fixed effects for Poisson and NB distribution.
| Effect | Num DF | Den DF | NB | Poisson | ||
|---|---|---|---|---|---|---|
| F value | Pr > F | F value | Pr > F | |||
| Time in month | 1 | 235 | 62.53 | < 0.0001 | 14.80 | 0.0002 |
| Sqrt_Time | 1 | 234 | 86.36 | < 0.0001 | 48.41 | < 0.0001 |
| Baseline BMI category | 3 | 6307 | 6.26 | 0.0003 | 6.31 | 0.0003 |
| ART initiation | 1 | 6307 | 345.45 | < 0.0001 | 5890.28 | < 0.0001 |
| Baseline VL | 3 | 6307 | 7.48 | < 0.0001 | 12.79 | < 0.0001 |
| No. of sexual partners | 2 | 6307 | 1.64 | 0.1935 | 1.85 | 0.1578 |
| Age group | 5 | 6307 | 1.46 | 0.1987 | 27.34 | < 0.0001 |
| Education level | 1 | 6307 | 0.25 | 0.6196 | 0.15 | 0.6990 |
| Place of residence | 1 | 6307 | 0.01 | 0.9246 | 0.11 | 0.7406 |
Parameter estimates using Poisson and NB mixed-effects model.
| Covariates | Negative binomial mixed-effects model | Poisson mixed-effects model | |||||
|---|---|---|---|---|---|---|---|
| Estimate | SE | Pr >|t| | 95% CI for NB estimate | Estimate | SE | Pr >|t| | |
| Intercept | 6.4697 | 0.04982 | < 0.0001 | (6.3715, 6.5679) | 6.4625 | 0.04264 | < 0.0001 |
| Time in month | 0.007824 | 0.000989 | < 0.0001 | (0.005875, 0.009774) | 0.006564 | 0.001706 | 0.0002 |
| Sqrt_Time | − 0.08649 | 0.009307 | < 0.0001 | (− 0.1048, − 0.06815) | − 0.06839 | 0.009830 | < 0.0001 |
| ART initiation (post) | 0.2301 | 0.01238 | < 0.0001 | (0.2058, 0.2543) | 0.1947 | 0.002537 | < 0.0001 |
| Obese | 0.4815 | 0.1113 | < 0.0001 | (0.2633, 0.6996) | 0.4985 | 0.1147 | < 0.0001 |
| Overweight | 0.02561 | 0.04975 | 0.6067 | (− 0.07191, 0.1231) | 0.03131 | 0.05148 | 0.5431 |
| Underweight | 0.005901 | 0.07927 | 0.9407 | (− 0.1495, 0.1613) | 0.01691 | 0.08264 | 0.8379 |
| High VL | − 0.2393 | 0.05157 | < 0.0001 | (− 0.3404, − 0.1382) | − 0.3074 | 0.05065 | < 0.0001 |
| Medium VL | − 0.1258 | 0.04587 | 0.0061 | (− 0.2157, − 0.03585) | − 0.1121 | 0.04686 | 0.0168 |
| Undetectable | 0.1377 | 0.2901 | 0.6351 | (− 0.4310, 0.7064) | 0.1199 | 0.2978 | 0.6872 |
| Many partners | − 0.1560 | 0.09394 | 0.0967 | (− 0.3402, 0.02811) | − 0.1674 | 0.09908 | 0.0911 |
| No partner | − 0.04821 | 0.04993 | 0.3343 | (− 0.1461, 0.04967) | − 0.05913 | 0.05164 | 0.2522 |
| 20–29 | 0.01166 | 0.03104 | 0.7072 | (− 0.04919, 0.07251) | − 0.00791 | 0.007830 | 0.3125 |
| 30–39 | 0.02852 | 0.03432 | 0.4060 | (− 0.03876, 0.09580) | − 0.01239 | 0.008474 | 0.1438 |
| 40–49 | − 0.00719 | 0.04545 | 0.8743 | (− 0.09629, 0.08191) | − 0.03422 | 0.01112 | 0.0021 |
| 50–59 | − 0.05694 | 0.06662 | 0.3927 | (− 0.1875, 0.07365) | − 0.1399 | 0.01549 | < 0.0001 |
| ≥ 60 | 0.2082 | 0.1532 | 0.1741 | (− 0.09205, 0.5084) | − 0.3107 | 0.03519 | < 0.0001 |
| Primary school | − 0.04509 | 0.09084 | 0.6196 | (− 0.2232, 0.1330) | − 0.03582 | 0.09263 | 0.6990 |
| Rural | − 0.00373 | 0.03947 | 0.9246 | (− 0.08112, 0.07365) | 0.01337 | 0.04038 | 0.7406 |
Figure 3Prediction of 7 randomly selected individual profiles plot of CD4 count for 4 years.
Combined results of a negative binomial mixed-effects model analysis using MI Procedure to deal with the missing values.
| Parameter | Parameter estimates (10 imputations) | |||||
|---|---|---|---|---|---|---|
| Estimate | SE | Pr >|t| | 95% confidence limits | Minimum | Maximum | |
| Intercept | 6.459413 | 0.049830 | < 0.0001 | (6.36175, 6.55708) | 6.458658 | 6.460775 |
| Time in month | 0.007475 | 0.000975 | < 0.0001 | (0.00556, 0.00939) | 0.007450 | 0.007508 |
| Sqrt_Time | − 0.083647 | 0.009266 | < 0.0001 | (− 0.10181, − 0.06549) | − 0.083982 | − 0.083434 |
| ART initiation (Post) | 0.224037 | 0.012594 | < 0.0001 | (0.19935, 0.24872) | 0.223216 | 0.225014 |
| Obese | 0.474714 | 0.109902 | < 0.0001 | (0.25931, 0.69012) | 0.473892 | 0.475630 |
| Overweight | 0.024208 | 0.048971 | 0.6211 | (− 0.07177, 0.12019) | 0.023820 | 0.024529 |
| Underweight | 0.002070 | 0.078101 | 0.9789 | (− 0.15101, 0.15515) | 0.001321 | 0.003137 |
| High VL | − 0.239102 | 0.051294 | < 0.0001 | (− 0.33964, − 0.13857) | − 0.239735 | − 0.238839 |
| Medium VL | − 0.122078 | 0.045390 | 0.0072 | (− 0.21104, − 0.03311) | − 0.122251 | − 0.121642 |
| Undetectable | 0.142848 | 0.286259 | 0.6178 | (− 0.41821, 0.70391) | 0.142510 | 0.143351 |
| Many partners | − 0.153632 | 0.092090 | 0.0953 | (− 0.33412, 0.02686) | − 0.154667 | − 0.152911 |
| No partner | − 0.046962 | 0.049227 | 0.3401 | (− 0.14344, 0.04952) | − 0.047267 | − 0.046691 |
| 20–29 | 0.013477 | 0.031659 | 0.6703 | (− 0.04857, 0.07553) | 0.012306 | 0.014325 |
| 30–39 | 0.033725 | 0.034974 | 0.3349 | (− 0.03482, 0.10227) | 0.032678 | 0.034744 |
| 40–49 | − 0.005842 | 0.046177 | 0.8993 | (− 0.09635, 0.08466) | − 0.007790 | − 0.004745 |
| 50–59 | − 0.052070 | 0.067501 | 0.4405 | (− 0.18437, 0.08023) | − 0.054207 | − 0.051024 |
| ≥ 60 | 0.206708 | 0.156046 | 0.1853 | (− 0.09914, 0.51255) | 0.205360 | 0.207553 |
| Primary school | − 0.046292 | 0.089605 | 0.6054 | (− 0.22191, 0.12933) | − 0.046602 | − 0.046009 |
| Rural | − 0.001916 | 0.038813 | 0.9606 | (− 0.07799, 0.07416) | − 0.002146 | − 0.001596 |