Literature DB >> 30061782

Comparison of Efficiency GEE and QIF Methods for Predicting Factors Affecting on Bipolar I Disorder Under Complete-case in a Longitudinal Studies.

Zahra Geraili-Afra1, Alireza Abadi2, Jamshid Yazdani-Charati3, Somayeh Ahmadi Gooraji2, Mehran Zarghami4, Samaneh Saadat5.   

Abstract

BACKGROUND: Mood variation in manic and depression phases during time is common in type I of Bipolar disorder. Analyzing recurrence require to the related statistical methods. In this paper, we compare the two methods of estimating the GEE and the QIF in recurrence data.
METHODS: In this study, data of 255 patients with Bipolar I disorder hospitalized during years of 2007-2011. Recurrence in Bipolar I disorder was as outcome. Patients' characteristics were gender, age of onset, recurrence history in first degree family, and economic status. Under simulation, percentage of missing were generated to vary and handled by complete-case(cc) strategy. Data were analyzed using GEE and QIF methods. Performance of the methods was assessed using Relative Efficiency.
RESULTS: QIF method had more efficiency than GEE method in the data with missing /without missing. Odds of recurrence in a first-degree family history was 30% more than those without a family history (p=0.009). Also, odds of recurrence in high/moderate level of economic status was 23% more than low level status (p=0.014).
CONCLUSION: QIF method was more appropriated for modeling recurrence during time with the structure of more correlation and low dropout rate in data. Family history and economic status were more affected recurrence in type I of Bipolar disorder.

Entities:  

Keywords:  Bipolar I disorder; Generalized Estimation Equations; Quadratic Inference Function; Recurrence; missing; simulation

Year:  2018        PMID: 30061782      PMCID: PMC6029899          DOI: 10.5455/aim.2018.26.111-114

Source DB:  PubMed          Journal:  Acta Inform Med        ISSN: 0353-8109


INTRODUCTION

Bipolar disorder caused to frequent mood variation in manic and depression or both during time (1). The recurrence in Bipolar I disorder is prevalent in different types of mood disorders (1, 2). About 90% of these patients experience recurrence (3-5). The Bipolar I disorder affected the ability and quality of life, social and occupational status of patients with frequent recurrence (6). Its complications including aggression and anxiety, sleep disorder, sexual problems and suicide (1). Although, Bipolar I disorder is treated through drug therapy, psycho-education of patients and their family; most patients experience recurrence yet (7). Based on findings of previous studies related to recurrence in Bipolar disorder, factors such as reducing drug dosage, drug dependency and substance abuse, sleep pattern, life events and social-mental factors (low social protection, Social inconsistency and entertainments) can affect recurrence significantly (8-10). On the other hand, the severity of emotional symptoms in patients with Bipolar disorder is varying frequently. So, that is required to assess emotional status repeatedly by the longitudinal study (repeated measurements of recurrence in each patient during time) in order to reach appropriate estimations (1, 11). Some studies had evaluated recurrence in Bipolar disorder in Iran. In a study in 2011, a multivariate regression was used to predict a recurrence rate for six months (12); in another studies in 2007 and 2002, patients with Bipolar disorder were followed up one year (13, 14); in a study in 2006, patients with first episode mania of Bipolar disorder were followed up for 17 months (15). On the other hand, observations obtained from repeated measurements are correlated in a longitudinal study (11). There are methods that include the within subject correlation structure in data such as Marginal model. The Generalized Estimation Equations (GEE) and Quadratic Inference Function (QIF) as a consistent and unbiased estimation methods in a Marginal model are used for modeling the within subject correlation. Also, they are used for any outcome such as count, categorical and normal (16, 17). On the other hand, missing data are more common in longitudinal studies especially in clinical trials which may be due to lack of accurately record or lost to follow up of patients during time. Large proportion of missing data can affect results, and probably cause to a great bias in analysis. One of the methods for handling incomplete datasets or missing is “complete-case” (18). These estimation methods were applied in a few studies on Bipolar disorder, in which the efficiency of two methods were compared without considering missing in data (19-23). In the present study, we also aimed to comparing effect of missing on two estimation methods by simulation. Besides, GEE and QIF methods are compared under missing data which are handled by the “complete-case strategy.

METHODS

Patients and design

This longitudinal study was done by census at Zare hospital in the province of Mazandaran in the north of Iran; during five years follow up from 2007 to 2011. Total of 255 hospitalized patients with Bipolar I disorder aged from 13 to 55 years were included in study. Diagnosis of Bipolar disorder and biography of patients was done by psychiatrist. Patients had experienced at least one recurrence or anxiety or depression or both based on DSM-IV-TR criteria of American psychiatry association in year of 2000. Data collecting was done through patients’ medical records and interview. The outcome variable is recurrence in Bipolar I disorder (event or not (0, 1)) and the predictor variables are as follow: age of onset, gender, recurrence history in first degree family, economic status.

Statistical method

For descriptive analysis, mean ±standard deviation (SD) for quantitative variables, and frequency with percent for qualitative variables was used. All other analyses were done with R software by simulation method. Significance level was defined as less than 0.05. This section is organized as follows: First, the briefly introduction of GEE and QIF used to analysis Bipolar I disorder data. Second, the complete-case strategies used in this study for handling missing. Third, relative efficiency for compare between two methods are introduced.

Generalized Estimation Equations

This method, first introduced by Liang and Zeger in 1986, is the most widely used statistical inference in marginal models (24). The generalized estimation equations will fit by the Quasi-likelihood method and a working correlation construct that require less correlation (disturbance) parameters. In this equation, is considered as vector of correlated observations, and the covariance of quasi-likelihood function is replaced by a working covariance matrix, in which, the diagonal matrix of the variance of the marginal variables of the response, the working correlation matrix including the correlation parameters and the dispersion parameter α and ø. The estimation of regression coefficients β is obtained from the solution of the following equation. In this equation, it is assumed that the marginal mean is related to covariate by link function.

Method of Quadratic Inference Function

In 2000, Qu et al. used a generalized moment method for marginal regression analysis to improve the method of generalized estimation equations (25). In other words, this method is quasi-likelihood i.e. in generalized linear models, in which the inverse of the correlation matrix is approximated by the linear combination of unknown constant and known matrices. The matrix is identity matrix, are known basis matrices and, are constant coefficients and unknown. We use the generalized moment method to construct the optimal inference function. The matrix is a consistent estimate of the sample variance. The regression parameter is estimated without the need to estimate the nuisance parameter α: Under certain circumstances, this is unique. The inference function in is called the Quadratic Inference Function. Because the estimator of the Quadratic Inference Function does not estimate the correlation nuisance parameter, therefore there is no emphasis on the correct estimation of the correlation parameter (24-26).

Mechanism of complete - case (cc)

In this method, units with missing values are deleted and statistical analysis is done based on units with complete data. The advantage of this method is that it can be used for any kind of statistical analysis and does not require a particular computational method. In this method, the estimator is unbiased when missing data being as completely at random. Otherwise, estimator is biased and the accuracy and power of study reduce by losing a lot of information. This method is effective only if there is a few missing in data (18).

Relative efficiency (R.E):

Relative efficiency (R.E) rate is estimated as proportion of mean square error (MSE) of GEE estimation than MSE of QIF. If proportion rate is estimated more than one, that implied on the better relative efficiency of QIF than GEE.

RESULTS

Data of 255 patients with bipolar I disorder was analyzed. Based on Table 1, most patients were males (%55.3). Mean age range was between 13- 55 years, and mean age onset of disease was 24.11±8.61 year. Of total, only %43 of patients had experienced recurrence in the first-degree family. Most patients had high/moderate level of economical status (168 (65.9%)).
Table 1.

Patients’ characteristics (n=255)

N (%)CategoryVariables
141(55.3)maleGender
114(44.7)female
109 (43)YesHifistory in first-degree family
146(57)No
87(34.1)low decileEconomical status
168(65.9)(middle /upper) decile
The results of demographic variables by QIF method is presented in Table 2. Results showed that the recurrence history in a first-degree family, and economical status were statistically significant (p<0.05). Chance of recurrence in patients with recurrence history in a first-degree family was 30% more than those without family history. Besides, the chance of recurrence in high/moderate level of economical status was 27% more than low status (p<0.001).
Table 2.

Regression results of QIF method in marginal model for Bipolar I disorder data (missing / without missing data) * OR: Odds ratio; **It is significant at level of 0.05.

Parameter% missingEstimateS.EOR*P-value
Age of onset0-0.0070.00440.9930.114
10-0.0060.00480.9940.193
20-0.0010.00480.9990.891
Gender (Female)00.0210.07971.0210.793
100.0140.09091.0140.876
200.0620.09231.0640.504
History in first degree family (Yes)00.2660.10251.3050.009
100.3090.11261.3620.602
200.2930.12631.3400.205
Economic status ( high/moderate level)00.2090.08491.2330.014
100.2370.09411.2680.117
200.1610.09311.1750.838
Time00.0210.01171.0210.072
100.0120.01251.0120.337
200.0190.01381.0200.157
The results of demographic variables by GEE estimation are presented in Table 3. Results showed that the recurrence history in a first-degree family and economical status were statistically significant (p<0.05). With age of onset increasing, chance of recurrence was reduced about 0.01. Besides, chance of recurrence in patients with recurrence history in a first-degree family was 28% more than those without family history.
Table 3.

Regression results of GEE method in marginal model for Bipolar I disorder data. * OR: Odds ratio; **It is significant at level of 0.05.

Parameter % missingEstimate S.EOR*P-value
Age of onset0-0.0070.00440.9930.096
10-0.0060.00490.9940.194
20-0.0040.00490.9960.438
Gender (Female)00.0290.07971.030.708
100.0290.09091.0300.742
200.0770.09281.0810.403
History in first degree family (Yes)00.2520.10291.2860.014
100.2800.11341.3240.013
200.3040.12681.3550.016
Economic status ( high/moderate level)00.1840.08501.2020.030
100.2100.09491.2340.027
200.1400.09351.1510.133
Time00.0160.01281.0160.216
100.0130.01391.0130.342
200.0230.01451.0240.107
In continue, the relative efficiency of these two estimation methods were compared for choosing the best method. To do comparison, a simulation method was used considering different coefficient of correlation and dropout rate in data. According to relative efficiency (R.E) rates obtained by simulation in table 4, QIF had the more efficiency than GEE in total and also when data correlation was high (ρ=0.6) and rate of dropout percent was low.
Table 4.

Brief results of simulation for relative efficiency (R.E) under different rates of correlation (ρ) and missing. a: Is created by simulation

RE Bipolar dataREa (ρ=0.6)REa (ρ=0.1)% of missing
1.1421.0650.9840
1.1351.0890.98910
1.1301.0350.99320

DISCUSSION

Regarding the periodic changes of bipolar disorder between Manic and depression, assessing recurrence and emotional status of these patients during time is more important; and the cross-sectional study of recurrence meets to limitations (27). Therefore, the longitudinal study is more applicable, in which that is required to especial statistical methods for modeling correlation structure of data during time. In the present study, we used two estimation methods (GEE and QIF) for correlated data, in order to reach a best method for predicting recurrence and affected factors. In our study, simulation results showed that QIF had more efficiency to predict factors related to recurrence whether with/without missing data. This recent result is accordance with results of previous longitudinal studies conducted by simulation. In a simulation study with count data (24) without missing, QIF and GEE methods were compared considering two cut points of correlation as 0.3 and 0.7. Results of their study showed that when correlation structure in data is given appropriately, QIF and GEE have equally efficiency in range of 1.2-2.07. This finding was accordance with the results of studies by Song et al; Odueyungbo et al; and Kazemi et al (21-23). In our study, QIF had the more efficiency when data correlation was high (ρ=0.6) and rate of dropout percent was low. Based on results of our study which obtained by QIF method, factors such as recurrence history in a first-degree family and economical status had significantly affected recurrence. Recurrence in patients with family history was more than those without family history. Findings of previous studies had confirmed that genetic has an important effect on cause of disorder, and hypertension can also effective on recurrence in younger patients (1, 3, 27-29). Results of some studies had shown that the rate of disorder was more in males due to economical problems, and also patients with the lower economical level were more at risk of Bipolar disorder and depression (12, 14, 17). These findings are accordance with the results of our study. Based on reports of other studies, bipolar disorder is more observed in males due to their job stress, but gender had not significantly affected recurrence (1, 12, 14, 30). These finding was similar to results of present study. According to some studies which had used QIF method, chance of recurrence was more in a lower age of disease onset, which may be due to the role of genetic in made of disorder in childhood. This finding no confirmed by our study (31). So far, there has not been done a proper longitudinal study in related with the effective factors on type 1 bipolar disorder in our country. So, follow up of patients during time, an estimation of appropriate sample size and trying to achieve a valid statistical method for analyzing longitudinal and correlated data is the most important strong point of our study. Regarding the findings of our study were small step for identifying the predictive factors of this mental disorder, suggestions are proposed to enhance such studies such as controlling effect of drug use as confounding variable which affected recurrence over time. Besides, that is suggested conducting more study on comparing statistical methods for analyzing longitudinal data to achieve valid results.

CONCLUSION

QIF method was more appropriated for modeling recurrence during time with the structure of more correlation and low dropout rate in data. Factors such as family history and economic status were more affected recurrence in Bipolar 1 disorder
  15 in total

1.  Quadratic inference functions in marginal models for longitudinal data.

Authors:  Peter X-K Song; Zhichang Jiang; Eunjoo Park; Annie Qu
Journal:  Stat Med       Date:  2009-12-20       Impact factor: 2.373

2.  Goodness-of-fit tests for GEE modeling with binary responses.

Authors:  H X Barnhart; J M Williamson
Journal:  Biometrics       Date:  1998-06       Impact factor: 2.571

3.  Screening for bipolar disorder in a primary care practice.

Authors:  Amar K Das; Mark Olfson; Marc J Gameroff; Daniel J Pilowsky; Carlos Blanco; Adriana Feder; Raz Gross; Yuval Neria; Rafael Lantigua; Steven Shea; Myrna M Weissman
Journal:  JAMA       Date:  2005-02-23       Impact factor: 56.272

Review 4.  Predictors of relapse in bipolar disorder: A review.

Authors:  Susan Altman; Sophia Haeri; Lisa J Cohen; Aleksey Ten; Elysa Barron; Igor I Galynker; Katherine N Duhamel
Journal:  J Psychiatr Pract       Date:  2006-09       Impact factor: 1.325

5.  Screening for bipolar disorder in the community.

Authors:  Robert M A Hirschfeld; Joseph R Calabrese; Myrna M Weissman; Michael Reed; Marilyn A Davies; Mark A Frye; Paul E Keck; Lydia Lewis; Susan L McElroy; James P McNulty; Karen D Wagner
Journal:  J Clin Psychiatry       Date:  2003-01       Impact factor: 4.384

6.  Early course of bipolar disorder in high-risk offspring: prospective study.

Authors:  Anne Duffy; Martin Alda; Tomas Hajek; Paul Grof
Journal:  Br J Psychiatry       Date:  2009-11       Impact factor: 9.319

Review 7.  Statistical methods for longitudinal research on bipolar disorders.

Authors:  John Hennen
Journal:  Bipolar Disord       Date:  2003-06       Impact factor: 6.744

8.  Psychosocial predictors of major affective recurrences in bipolar disorder: a 4-year longitudinal study of patients on prophylactic treatment.

Authors:  G Stefos; F Bauwens; L Staner; D Pardoen; J Mendlewicz
Journal:  Acta Psychiatr Scand       Date:  1996-06       Impact factor: 6.392

9.  Antidepressant dosage taken by patients with bipolar disorder: factors associated with irregularity.

Authors:  Rita Bauer; Tasha Glenn; Martin Alda; Kemal Sagduyu; Wendy Marsh; Paul Grof; Rodrigo Munoz; Greg Murray; Philipp Ritter; Ute Lewitzka; Emanuel Severus; Peter C Whybrow; Michael Bauer
Journal:  Int J Bipolar Disord       Date:  2013-12-09

10.  Comparison of generalized estimating equations and quadratic inference functions using data from the National Longitudinal Survey of Children and Youth (NLSCY) database.

Authors:  Adefowope Odueyungbo; Dillon Browne; Noori Akhtar-Danesh; Lehana Thabane
Journal:  BMC Med Res Methodol       Date:  2008-05-09       Impact factor: 4.615

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.