Literature DB >> 30127613

Novel ensemble method for the prediction of response to fluvoxamine treatment of obsessive-compulsive disorder.

Hesam Hasanpour¹, Ramak Ghavamizadeh Meibodi¹, Keivan Navi¹, Sareh Asadi².

Abstract

OBJECTIVE: About 30% of obsessive-compulsive disorder (OCD) patients exhibit an inadequate response to pharmacotherapy. The detection of clinical variables associated with treatment response may result in achievement of remission in shorter period, preventing illness development and reducing socioeconomic costs.
METHODS: In total, 330 subjects with OCD diagnosis underwent 12-week pharmacotherapy with fluvoxamine (150-300 mg). Treatment response was ≥25% reduction in Yale-Brown Obsessive-Compulsive Scale (Y-BOCS) score. In total, 36 clinical attributes of 151 subjects who had completed their treatment course were analyzed. Data mining algorithms included missing value handling, feature selection, and new analytical method based on ensemble classification. The results were compared with those of other traditional classification algorithms such as decision tree, support vector machines, k-nearest neighbor, and random forest.
RESULTS: Sexual and contamination obsessions are high-ranked predictors of resistance to fluvoxamine pharmacotherapy as well as high Y-BOCS obsessive score. Our results showed that the proposed analysis strategy has good ability to distinguish responder and nonresponder patients according to their clinical features with 86% accuracy, 79% sensitivity, and 89% specificity.
CONCLUSION: This study proposed an analytical approach which is an accurate and a sensitive method for the analysis of high-dimensional medical data sets containing more number of missing values. The treatment of OCD could be improved by better understanding of the predictors of pharmacotherapy, which may lead to more effective treatment of patients with OCD.

Entities: Chemical

Keywords: attribute bagging; contamination; ensemble classification; fluvoxamine; obsessive–compulsive disorder; sexual obsession; treatment predictors

Year: 2018 PMID： 30127613 PMCID： PMC6091249 DOI： 10.2147/NDT.S173388

Source DB: PubMed Journal: Neuropsychiatr Dis Treat ISSN： 1176-6328 Impact factor: 2.570

Introduction

Obsessive–compulsive disorder (OCD) is a neuropsychiatric disorder and affects 1%–3% of the population worldwide1 and 1.8% (0.7% and 2.8% in men and women, respectively) in Iran.2 Pharmacotherapy and cognitive–behavioral therapy are considered effective for the treatment of this disorder.3 First-line drugs for OCD pharmacotherapy are serotonin reuptake inhibitors (SRIs),4 but 40%–60% of patients do not respond adequately to a trial of these drugs.5,6 The Yale-Brown Obsessive–Compulsive Scale (Y-BOCS) is frequently used to quantify the severity of obsessive–compulsive (OC) symptoms. Responders are clinically defined as patients who show >25% or 35% decline in Y-BOCS rating, although they may experience significant impairment from their residual OCD symptoms.4,7 Approximately one third of nonresponders to initial SRI monotherapy respond to a second, different SRI, but others are refractory patients who do not respond adequately to SRI pharmacotherapy.5 To find the most effective SRI, each medication has to be tried sequentially for at least 12 weeks.8 Using appropriate response predictors may result in the achievement of remission in shorter period, preventing illness development and reducing socioeconomic costs. There have been several attempts to detect predictors of treatment response with SRIs using demographic and clinical characteristics of OCD patients, their genotype, and the results of neuroimaging assessments.6,9–11 Studies investigated the possible association of OCD clinical characteristics, including symptom dimensions and SRI treatment, reported different predictors for treatment response. Factors that have been associated with poor response to OCD treatment include hoarding dimension,12–17 somatic obsessions,6 contamination and cleaning,15,18,19 repeating rituals and counting compulsions,13 obsessions of symmetry,15,17,20 poor insight,6,21,22 sexual/religious obsessions,23,24 severity of compulsions,19,25 early onset and chronic course of OC symptoms,6,19,26–30 psychiatric comorbidity,31–38 lack of sensory phenomena and greater symptom severity,35,39 SRI treatment at intake,37 absence of family history,6 family involvement in the OC symptoms,40,41 being male,30 being older at intake,30 and longer duration of illness.36,42 While greater OCD severity at intake was reported as the predictor of poor response to treatment,10,11,30,36,38,40,43,44 three studies found a better response in those with higher baseline severity of illness.45–47 Moreover, forbidden thoughts (sexual/religious/harm-related obsessions) with checking compulsions were associated with better acute medication response,15,20 but poor long-term outcome and treatment refractoriness were also reported.20,23,48 The other predictors of good response to SRI treatment were having a partner35,37,40 and washing and obsessive thoughts.49 The presence of numerous different kinds of variables that affect response to OCD treatment makes it difficult to detect appropriate predictors for treatment responder and nonresponder discrimination. Data mining provides an opportunity for the assessment of all potential predictors simultaneously. Machine learning methods not only consider the effect of each variable on the outcome of interest separately but also identify patterns of information that are useful to predict outcomes at the individual patient level.50 Machine learning is commonly used in the social and applied sciences, but limited attempts use this method in clinical research, especially psychiatry studies. However, some machine learning methods such as support vector machine (SVM), support vector regression, and random forest (RF) were used in OCD clinical research. Hoexter et al used machine learning methods to discriminate patients from healthy controls through brain structural magnetic resonance imaging and to predict OCD severity in patients.51 In another study, machine learning methods were used to predict remission in OCD.25 To our knowledge, feature selection and ensemble classification have not yet been applied to predict treatment response in OCD. In this study, we aimed at developing a classification algorithm based on an ensemble of classifiers to be used for the prediction of treatment response in order to help individualize clinical assignment of treatment. We assessed demographic and clinical variables of Iranian OCD patients to find the most important attributes and applied the proposed machine learning approach to predict OCD treatment response in fluvoxamine pharmacotherapy.

Methods

Subjects and treatment procedure

In total, 330 outpatients with Iranian origin meeting the text revision of the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV-TR) criteria for OCD were recruited from Imam Hossain Hospital (Tehran, Iran) between 2014 and 2017. The diagnosis was established for the subjects by the administration of the structured clinical interview for DSM-IV-TR by psychiatrists, and the patients were checked for disease symptoms and severity using the Persian version of Y-BOCS checklist and severity scale.52 Inclusion criteria were the following: 1) meeting DSM-IV-TR criteria for OCD; 2) aged between 18 and 65 years; 3) having OCD symptoms for >1 year; and 4) a drug-free period of at least 3 weeks. Exclusion criteria were the following: 1) with a history of mental retardation; 2) having psychotic disorders; 3) having severe neurological pathology; 4) having other DSM-lV-TR Axis I disorders except depression, anxiety, or tic disorder; 5) with a history of substance use; 6) with a total Y-BOCS severity <9; 7) being under the selective serotonin reuptake inhibitor (SSRI) or antidepressant pharmacotherapy; 8) having prominent suicidal ideation; 9) pregnancy or lactation; and 10) refusal to participate or take treatment procedure. All the subjects completed a semi-structured interview that recorded clinical and sociodemographic data, such as age at assessment, gender, marital status, educational level, occupation, chief complaint, age of onset, and the familial history of any psychiatric disorders, specifically OCD. Pharmacotherapy was defined as 12-week treatment with fluvoxamine (150–300 mg). No concomitant therapy was allowed during the whole treatment period, either pharmacological or nonpharmacological. Fluvoxamine daily dose was initiated from 25 mg/day, increased to 50 mg/day after a week, and up to 100 mg/day for the third week. Its daily dose was increased up to 150 mg/day for the next 3 weeks, and after the sixth week, the patients were visited by the psychiatrist, and the fluvoxamine dose was adjusted based on the severity of patients’ symptoms. As ~60% of patients experience SRI pharmacotherapy that provided at least a 25%–35% decrease in OC severities on Y-BOCS,7 one of these cut points has typically been considered as the criterion for responsiveness. In an adequate trial of an SRI, a <25% decrease in the Y-BOCS score in patients with at least moderate symptom severity is usually considered nonresponse.4 We also chose 25% improvement cut point for fluvoxamine responsiveness. The severity of OCD symptoms at the first visit before treatment initialization and after 12-week treatment was scored using the Y-BOCS, by an experienced psychologist. According to the reduction in patients’ Y-BOCS scores at the beginning of the treatment compared with the scores after 12 weeks of treatment with fluvoxamine, they were divided into two groups: Group A (responders) was comprised of patients who exhibited >25% reduction in Y-BOCS scores after treatment with fluvoxamine, and group B (nonresponder) was comprised of patients who exhibited <25% reduction in Y-BOCS scores.5,53 We included another group (refractory patients) comprised of patients experienced various SSRI trials during their illness period, but the severity of their symptoms did not change or even became worse.4 Upon OCD diagnosis and meeting the inclusion criteria, 368 patients were invited to participate in the study, but some of them (19 patients) refused to participate, and 19 patients were excluded from the study due to total Y-BOCS severity score <9 (N=6), the history of substance use (N=3), the presence of psychotic disorders or severe neurological disorders (N=4), and other reasons (N=6). Of 330 patients who had participated in the study, 151 had completed it and were eligible for the analysis of response predictors. The remainders were excluded due to denial to come for follow-up (N=110), refusal to take medication (N=38), and other reasons (N=31). The CONSORT diagram in Figure 1 summarizes the flow of participants through different stages of the trial.

Figure 1

Consort diagram of the study.

Abbreviation: Y-BOCS, Yale-Brown Obsessive–Compulsive Scale.

Written informed consent was obtained from all subjects after a full description of the study. This study was performed in accordance with the World Medical Association’s Declaration of Helsinki and approved by the Neuroscience Research Center Ethical Committee (Project No IR.SBMU.PHNS.REC.1396.2).

Data processing

All data processing and analyses were conducted using MATLAB Version 2014a (The MathWorks Inc., Natick, MA, USA).

Variables

Response to treatment (responder, nonresponder, and refractory) was considered as a dependent variable, and sociodemographic and clinical variables were considered as independent variables (predictors). Thirty-six variables were studied as predictors such as gender, marital status, employment, educational level, family history, initial Y-BOCS obsession score, initial Y-BOCS compulsion score, Y-BOCS obsession subtypes, Y-BOCS compulsion subtypes, insight, avoidance, depression score, age of onset, and illness duration. Moreover, exploratory factor analysis was applied to the 13 Y-BOCS obsession and compulsion categories.54 Underlying factors were detected as factor 1: aggression, contamination, sexual, and religious obsessions as well as somatic, checking, and repeating compulsions; factor 2: symmetry obsession and cleaning, counting, and ordering compulsions; factor 3: contamination obsession and cleaning compulsion; and factor 4: hoarding obsession and compulsion. These factors were also considered as variables in our analysis.

Missing values

One of the important challenges in the analysis of medical data sets is the presence of missing values. Several methods have been proposed to treat missing data such as case deletion or replacing the missing attribute by the mean of known values of that attribute. Acuna and Rodriguez55 compared four different methods for treating missing values (case deletion, mean imputation, median imputation, and k-nearest neighbor [KNN] imputation) to evaluate the misclassification error rate of these methods. Their results showed that KNN imputation procedure performs better in dealing with missing values.55 In KNN imputation method, the missing values of an instance are imputed, considering a given number of instances that are most similar to the instance of interest. In the present study, we used KNN imputation procedure described in Box 1. Briefly, at first, we divided the samples into two groups, the samples with missing values and the samples without missing values. At the next step, for each instance of interest with missing values, we found the KNNs from the samples with complete data that have the same class of treatment response (responder, nonresponder, and refractory) and calculated the means of continuous variables or found the modes of nominal or ordinal features of these samples. Finally, we imputed these values to the missing values of the instance of interest. Thirty-one patients (21%) had attributes with missing values, and other patients’ data (79%) were complete. The proportion of missing data to all was 4%. The mechanism of missing data was missing at random which means that propensity for a data point to be missing is related to some of the observed data.

Feature selection

The presence of a lot of features affecting treatment response in psychiatric disorders and the small sample size emerge as common problems for the detection of treatment response predictors. Feature selection may improve accuracy and efficiency of classifier methods by finding the most appropriate features. Feature selection techniques can be classified into three groups: filter methods, wrapper methods, and embedded methods.56–58 In filter methods, features are selected based on the relevance of their intrinsic characteristics to the target classes using statistical tests such as Independent samples t-test or F-test.59 In wrapper methods, feature selection is “wrapped” around a learning method, and its importance is directly judged by the estimated accuracy of the learning method. Wrapper methods typically require extensive computation to search the best features, but the characteristics of the selected features match well with the learning method.60 Feature selection in embedded methods performs through the process of training and is usually specific to given learning machines. This kind of methods have the advantage of including the interaction with the classification model in feature selection procedure, while at the same time being far less computationally intensive than wrapper methods.58 According to different challenges in feature selection, there is not a specific algorithm that works best under all conditions.58 In the present study, we used maximum relevance minimum redundancy (MRMR, Equation 1) algorithm for feature selection, which is a classifier-independent method and needs less computational time.61,62 MRMR criteria have a good overall trade-off for accuracy/stability compared with other criteria.63 One of the advantages of MRMR criteria is detecting correlated features (which cannot be ruled out in simple filter methods) to exclude redundant ones from further analysis. The first part of the MRMR formula is maximum relevance, and the second part is called minimum redundancy. Equation 1 is an incremental method of MRMR implementation. According to this equation, we calculated mutual information of each feature (x) with the class label (c). The feature with maximum value was selected as the first feature. Next, the mean value of mutual information between each feature and selected features (S−1) was subtracted from the mutual information of each feature and class label. After finding the maximum, the second appropriate feature was found. This step was repeated several times until all the features were selected (in Equation 1, X denotes all features). In the present study, we used MRMR algorithm as a way for weighting variables and excluding redundant features. If the result of Equation 1 was <0, we excluded that feature from further analysis. In other words, if mutual information of a feature with the class label is less than the mean value of mutual information between feature and selected features, then that feature was removed. The weight of each feature has been calculated using the result of Equation 1 divided by total MRMR values of all features. To assess the relevance of features, we conducted chi-squared test for nominal attributes and independent sample t-test for continuous ones, and then, 1−(p value) was used as a merit of a feature.

Data analysis

Proposed algorithm

We used ensemble learning for the classification of our data set. In ensemble methods, multiple learning algorithms are used to achieve better predictive performance compared with each learning algorithm alone. The results showed that ensemble of classifiers can enhance class prediction even though individual classifiers might be rather weak and error-prone in making decisions.64,65 Successful applications of ensemble methods can be found in bioinformatics66 and medicine.67,68 Bagging,69 Boosting,70 and RF71 are famous ensemble-based algorithms. In ensemble classification, a number of base classifiers are trained. At the test time, test samples are given to all the base classifiers, and the class label of samples is determined, typically through majority vote, based on the output of all base classifiers.72 It is obvious that we should create diversity in the output of base classifiers; otherwise, the accuracy of the ensemble method does not change. For example, multilayer perceptron (MLP) with different structures or different initialization weights can be used as a base classifier. It is usually assumed that increasing diversity may decrease ensemble error.73 Theoretical and empirical results suggested that one of the most effective methods of achieving independence classifiers is attribute bagging (training the members of an ensemble on qualitatively different feature (sub)sets).74,75 It has been shown that the best voting accuracy is achieved for attribute subset sizes between one third and half of the total number of attributes.74 The proposed method has two phases: 1) sorting the features using MRMR algorithms and 2) weight assignment to each feature based on the importance of that feature. After sorting the features based on their weights, we removed attributes that have a weight less than a predefined threshold. Next, we generated several base classifiers (decision tree in our case) using a subset of features that were selected by roulette wheel sampling.76 With the roulette wheel algorithm, we selected attributes randomly according to their weights. At the time of base classifiers construction, we pruned base classifiers that have accuracy less than a predefined threshold (75% in our study). We split the data set into training and testing parts. The first part is used for training the model, and the second part is used for the validation of the model. At the test time, we used majority vote for determining the class label of samples. If base classifiers could not determine the class label accurately (eg, 45% of base classifiers agreed on class 1 and 55% agreed on class 2), we determined the class label of test sample based on the base classifier that has the best training accuracy. Box 2 describes the pseudocode for our proposed method. Abbreviation: MRMR, maximum relevance minimum redundancy.

Traditional classification algorithms and performance measures

We compared the results of our proposed method with several well-known learning models that are commonly used for classification including MLP, KNN, SVM, decision tree, and RF. MLP is a popular artificial neural network architecture with backpropagation (a supervised learning algorithm). It has been shown that given the right size and the structure, MLP is capable of learning arbitrarily complex nonlinear functions to arbitrary accuracy levels.77 KNN is based on the principle that the cases within a data set will generally exist in close proximity to other cases that have similar properties.78 The KNN algorithm locates the k-nearest instances to the query instance and determines its class by identifying the single most frequent class label. SVM is a maximum margin classification algorithm, which exploits information about the inner products in some feature space.79,80 Studies showed that these algorithms have a good ability for classification in medical data sets.81,82 Decision trees are powerful classification algorithms such as Quinlan’s ID3, C4.5, and C5 that are becoming increasingly more popular with the growth of data mining in the field of medicine.83 RF is an ensemble learner, a method that generates many classifiers and aggregates their results.71 RF shows high predictive accuracy and is applicable in high-dimensional data sets with highly correlated features, such as medical data sets.84,85 For evaluating the proposed analytical model, we used three performance measures: accuracy (Equation 2), sensitivity (Equation 3), and specificity (Equation 4), which are usually defined for binary classification. As we dealt with a multiclass application, we calculated these measures based on the following formulas:86 l is the number of class where TP, TN, FP, and FN denote true positives, true negatives, false positives, and false negatives for class i, respectively. True positive (TP) is the number of positive samples that are correctly identified as positive. False positive (FP) is the number of negative samples that are incorrectly identified as positive. False negative (FN) denotes the number of positive samples that are incorrectly identified as negative. True negative (TN) is the number of negative samples that are correctly identified as negative.

Results

Sociodemographic data of the studied sample

Table 1 summarizes sociodemographic and clinical variables of 151 patients who had completed pharmacotherapy. Responder, nonresponder, and refractory groups consisted of 68%, 50%, and 68% women; 68%, 64%, and 63% married patients; and 64%, 41%, and 59% unemployed patients (Table 1); 15% of patients in the responder group reported no history of mental illness in their family, but others (85%) were from the families with the history of psychiatric disorders. Percentages of positive family history in nonresponder and refractory groups were reported as 85% and 77%, respectively (Table 1).

Table 1

Clinical variables of patients in each treatment response classes

Variables	Responder	Nonresponder	Refractory	p value
Sex
Female	65	17	15	0.14
Male	30	17	7	0.14
Marital status
Single	30	12	8	0.86
Married	65	22	14	0.86
Occupation
Unemployed	61	14	13	0.06
Employed	34	20	9	0.06
Family history
Negative	14	5	5	0.63
Positive	81	29	17	0.63
Age at assessment
Mean	34.3	34.8	32.9	0.77
SD	10.6	9.9	8.5	0.77
Age of onset
Mean	24.0	20.9	20.6	0.18
SD	10.0	12.0	8.8	0.18
Illness duration
Mean	10.3	13.9	12.2	0.17
SD	9.8	8.7	11.9	0.17
Initial obsession score
Mean	10.3	12.4	12.0	0.04
SD	4.7	4.9	3.9	0.04
Initial compulsion score
Mean	9.3	9.4	8.6	0.85
SD	4.9	6.3	6.1	0.85
Initial total score
Mean	19.6	21.8	20.7	0.43
SD	8.2	10.0	8.7	0.43
Contamination obsession
Lack	29	4	10	0.01
Presence	66	30	12	0.01
Sexual obsession
Lack	72	16	19	0.001
Presence	23	18	3	0.001

Abbreviation: SD, standard deviation.

Patients’ ages of onset (mean±SD) were 24.0±10.0 in responder group, 20.9±12.0 in the nonresponder group, and 20.6±8.8 in the refractory group. The mean values of initial Y-BOCS scores for obsession (mean±SD) were 10.3±4.7 for responders, 12.4±4.9 for nonresponders, and 12.0±3.9 for refractory patients. One-way analysis of variance with Tukey’s post hoc test showed that obsession scores were significantly different between these three groups, and responders reported less severe obsessions compared with nonresponder and refractory patients. However, compulsion and total Y-BOCS scores were not significantly different between these three groups (Table 1). Analyses revealed that contamination and sexual obsessions were the most important predictors for fluvoxamine pharmacotherapy. Sixty-nine percent of responder patients reported contamination symptoms, whereas 88% of nonresponder patients had contamination obsessions. Sexual obsessions were also observed in 24% of responder patients, but in 52% of nonresponder patients (Table 1).

Treatment predictors

Of the 36 initial features, 19 were selected for further analysis as the result of feature selection and others were removed. Attributes such as sexual and contamination obsessions, obsession severity, and illness duration were selected to predict treatment response of patients, and attributes such as marital status, hoarding, and religious obsessions were removed from further analysis.

Model evaluation

Table 2 summarizes the average performance of different classification algorithms for the treatment response data set based on the results of 20 replications of 10-fold cross-validation. We used 10-fold stratified cross-validation for evaluating predictive models that is the best estimation technique, especially in data sets with small sample size.87,88 It must be noted that our data set contains 151 samples that belong to three classes; 95 patients responded adequately to treatment (responder class), 34 patients exhibited inadequate response to fluvoxamine (nonresponder class), and 22 patients belonged to refractory class. The values in the parentheses are the confidence interval of that measure at a 95% confidence level. The results showed that MLP is not a good algorithm for this data set, and decision tree is relatively better than other classification models. The results also showed that the new method which is proposed in this study is the best classifier for dealing with this data set. The accuracy, sensitivity, and specificity of the proposed method were 86%, 79%, and 89%, respectively.

Table 2

Accuracy, sensitivity, and specificity of different classification algorithms applied on the current OCD data set based on 20 repetitions of 10-fold cross-validation

Algorithms	Accuracy (%) (CI)	Recall (sensitivity) (%) (CI)	Specificity (%) (CI)
MLP	64 (56–71.2)	46 (38.2–53.9)	73 (65.4–79.4)
Decision tree	76 (68.6–82.1)	64 (56–71.2)	82 (75.1–87.3)
KNN	74 (66.4–80.3)	61 (53–68.4)	80 (72.9–85.6)
SVM	74 (66.4–80.3)	62 (54–69.3)	80 (72.9–85.6)
Random forest	75 (67.5–81.2)	62 (54–69.3)	81 (74–86.4)
New method	86 (79.5–90.6)	79 (71.8–84.7)	89 (83–93)

Abbreviations: CI, confidence interval; KNN, k-nearest neighbor; MLP, multilayer perceptron; OCD, obsessive–compulsive disorder; SVM, support vector machine.

Table 3 shows the values of TP, FP, FN, and TN for each treatment response class, which were resulted from applying conventional methods and new analysis method on our data set. The proposed analytical algorithm correctly assigned 83 subjects to responder class, 23 subjects to nonresponder class, and 13 subjects to refractory class. These tables show that some algorithms such as RF are very good at the prediction of responder class, but very poor on two other classes.

Table 3

TP, FP, FN, and TN values for each treatment response class resulted from some algorithms and new method

Algorithms	TP	FP	FN	TN
SVM
Responders	85	44	10	12
Nonresponders	7	8	27	109
Refractory	2	5	20	124
Random forest
Responders	89	47	6	9
Nonresponders	5	8	29	109
Refractory	1	1	21	128
Decision tree
Responders	72	24	23	32
Nonresponders	18	20	16	97
Refractory	7	10	15	119
Proposed method
Responders	83	13	12	43
Nonresponders	23	11	11	106
Refractory	13	8	9	121

Abbreviations: FN, false negative; FP, false positive; TN, true negative; TP, true positive; SVM, support vector machine.

Table 4 summarizes the accuracy, sensitivity, and specificity of the new method for each treatment response class. The accuracy values of the proposed method for prediction of classes 1–3 were 83%, 85%, and 89%, respectively. The sensitivity values were 87%, 68%, and 59%, respectively, for classes 1–3. Specificity measures were obtained as 77%, 90%, and 94%, respectively, for classes 1–3.

Table 4

Accuracy, sensitivity, and specificity of each of class resulted from the new method

Patients’ class	Accuracy (%)	Recall (sensitivity) (%)	Specificity (%)
Class 1 (responder)	83	87	77
Class 2 (nonresponder)	85	68	90
Class 3 (refractory)	89	59	94

Discussion

In the current study, we aimed at proposing a model suitable to predict the final outcome of OCD pharmacotherapy using fluvoxamine. From the results of our analysis, it appears that sexual and contamination obsessions and higher Y-BOCS obsessive scores are high-ranked predictors of resistance to fluvoxamine pharmacotherapy. Moreover, our proposed strategies for data analysis including missing value handling, feature selection using MRMR algorithm, and new analytical method based on ensemble classification are best suited for dealing with the OCD treatment response data set and prediction of fluvoxamine pharmacotherapy result. Data mining approaches can be used to process high-dimensional data sets such as medical data sets for the prediction of treatment response in order to help individualize clinical assignment of treatment. In the current study, we proposed a new ensemble classification that showed good accuracy, sensitivity, and specificity over some commonly used classification methods. This method appears to have a good potential for medical decision-making in the assignment of patients to treatment based on clinical characteristics. Medical data sets including those related to treatment response compromise numerous demographic, clinical, and genetic variables over a relatively small number of patients, which present challenges for analysis and data extraction. There are limited studies investigating the prediction of treatment response in OCD patients, and almost all of them dealt with these challenges. In this study, we used several strategies to reduce the number of variables and increase the ratio of samples to variables. First, we used MRMR algorithm for weighting the attributes. Second, we eliminated features with low association with treatment response classes. Finally, we applied attribute bagging for improving the accuracy and stability of classifier ensembles using random subsets of features. The results of the current study showed that predominance of contamination and sexual symptoms, as well as high scores in Y-BOCS obsessive subscores, are predictive of poor response to fluvoxamine. These findings are consistent with the previous reports that contamination and cleaning may represent markers of poor prognosis.15,18,19,25,38 Previous investigations revealed that different brain networks might be involved in OC symptoms such as checking, washing, symmetry, and hoarding.89–94 These findings suggest that poor treatment response of OCD patients with predominant contamination and sexual symptoms and higher obsession scores may be related to relatively distinct neural circuits correlated with these symptoms. The current investigation has a number of important strengths as well as some key limitations. This study is the first try to investigate predictors of OCD treatment response in an Iranian sample. Moreover, we proposed a new method of ensemble classification for treatment response prediction, which enables more comprehensive examination of potential predictors of remission. Missing value handling and feature selection improved analysis approach and make it possible to perform more accurate prediction. The other advantage of the current study was the administration of just one drug, fluvoxamine, which might reduce confounding factors. There are several shortcomings inherent in this study. Treatment response was evaluated after 12-week pharmaco-therapy, while the assessment of treatment response outcome in more extended period may detect stronger predictors of remission. Besides, some patients who reported drug side effects were excluded from the study or did not complete their medication. Altogether, a large number of patients were eliminated in different stages of the study, resulted in the small sample size of the study. Moreover, sampling was performed from a single medical center at Tehran leaded to the lack of diversity in our sample. Therefore, it may prevent our findings from being generalized to members of different cultural groups or other populations. Replication of the proposed method with independent and larger samples is suggested.

Conclusion

This study proposed an analytical approach that is an accurate and a sensitive method for the analysis of high-dimensional medical data sets containing more number of missing values. The treatment of OCD could be improved by better understanding of the predictors of pharmacotherapy, which may lead to more effective treatment of patients with OCD and more accurate prognostic information to them. Moreover, it helps to estimate how many patients will need access to alternative treatments, which is important from the public health perspective.

Partition data into two groups: data with complete attributes and data with missing values

For each sample with missing values

X=Find k-nearest neighbors of complete samples that belong to the same class

For each features of sample with missing values

If data type of feature is nominal or ordinal, then find the mode of the values for that feature (in X samples) and impute to missing feature

else find the mean of the values of that feature and impute to missing feature

Compute the weight of each feature based on MRMR algorithm and delete the redundant features

i=1, k=number_of_base classifiers (20 in our case)

Min=number_of_attributes/3;

Max=2*number_of_attributes/3;

While i≤k

t=generate a random number between Min and Max

New_attributes=select t attributes from all attributes using roulette wheel

Model=build a base classifier using New_attributes

If accuracy (model)>threshold (%75)

Base classifiers(i)= model

i=i+1

End if

End while

//test the model

For each of the test samples

If percent of base classifiers that agree on class label of test sample>threshold (%50)

Predict the class label of test sample using voting

Else

Predict the class label of test sample using best base classifier that obtained from previous section

End for

Abbreviation: MRMR, maximum relevance minimum redundancy.

66 in total

Review 1. A review of feature selection techniques in bioinformatics.

Authors: Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal: Bioinformatics Date: 2007-08-24 Impact factor: 6.937

2. A randomized controlled trial of Japanese patients with obsessive-compulsive disorder--effectiveness of behavior therapy and fluvoxamine.

Authors: Eriko Nakatani; Akiko Nakagawa; Tomohiro Nakao; Chika Yoshizato; Maiko Nabeyama; Akiko Kudo; Kayoko Isomura; Naoko Kato; Kazuko Yoshioka; Midori Kawamoto
Journal: Psychother Psychosom Date: 2005 Impact factor: 17.659

3. Clinical predictors of drug response in obsessive-compulsive disorder.

Authors: S Erzegovesi; M C Cavallini; P Cavedini; G Diaferia; M Locatelli; L Bellodi
Journal: J Clin Psychopharmacol Date: 2001-10 Impact factor: 3.153

4. Long-term follow-up and predictors of clinical outcome in obsessive-compulsive patients treated with serotonin reuptake inhibitors and behavioral therapy.

Authors: P Alonso; J M Menchon; J Pifarre; D Mataix-Cols; L Torres; P Salgado; J Vallejo
Journal: J Clin Psychiatry Date: 2001-07 Impact factor: 4.384

5. Hoarding in obsessive compulsive disorder: results from a case-control study.

Authors: J Samuels; O Joseph Bienvenu; M A Riddle; B A M Cullen; M A Grados; K Y Liang; R Hoehn-Saric; G Nestadt
Journal: Behav Res Ther Date: 2002-05

6. Predictors of treatment response in obsessive-compulsive disorder: multivariate analyses from a multicenter trial of clomipramine.

Authors: D L Ackerman; S Greenland; A Bystritsky; H Morgenstern; R J Katz
Journal: J Clin Psychopharmacol Date: 1994-08 Impact factor: 3.153

10. Factor analysis of symptom subtypes of obsessive compulsive disorder and their relation to personality and tic disorders.

Authors: L Baer
Journal: J Clin Psychiatry Date: 1994-03 Impact factor: 4.384

2 in total

Review 1. Supervised Machine Learning: A Brief Primer.

Authors: Tammy Jiang; Jaimie L Gradus; Anthony J Rosellini
Journal: Behav Ther Date: 2020-05-16

2. Fluvoxamine treatment response prediction in obsessive-compulsive disorder: association rule mining approach.

Authors: Hesam Hasanpour; Ramak Ghavamizadeh Meibodi; Keivan Navi; Jamal Shams; Sareh Asadi; Abolhassan Ahmadiani
Journal: Neuropsychiatr Dis Treat Date: 2019-04-10 Impact factor: 2.570