| Literature DB >> 27783715 |
Jorge Barros1, Susana Morales1, Orietta Echávarri1, Arnol García2, Jaime Ortega2, Takeshi Asahi2, Claudia Moya3, Ronit Fischman4, María P Maino1, Catalina Núñez5.
Abstract
OBJECTIVE: : To analyze suicidal behavior and build a predictive model for suicide risk using data mining (DM) analysis.Entities:
Mesh:
Year: 2016 PMID: 27783715 PMCID: PMC7112738 DOI: 10.1590/1516-4446-2015-1877
Source DB: PubMed Journal: Braz J Psychiatry ISSN: 1516-4446 Impact factor: 2.697
Sociodemographic characteristics of the sample and differences between groups
| Variable | Total (n=707) | No current suicidal behavior (n=358) | Current suicidal behavior (n=349) | Test |
|---|---|---|---|---|
| Age (years) | ||||
| Mean (SD) | 39.7 (14.9) | 42.2 (14.5) | 37.2 (14.8) | t = -4.4993, df = 704, p < 0.001 |
| Sex | ||||
| Female | 564 (79.8) | 287 (80.2) | 277 (79.4) | χ2 = 0.029053, df = 1, p = 0.864 |
| Male | 143 (20.2) | 71 (19.8) | 72 (20.6) | |
| Marital status | ||||
| Married | 259 (36.6) | 148 (41.3) | 110 (31.5) | χ2 = 13.12, df = 3, p < 0.05 |
| Unmarried | 33 (4.7) | 19 (5.3) | 13 (3.7) | |
| Single | 295 (41.7) | 127 (35.5) | 169 (48.4) | |
| Divorced or widowed | 120 (17.0) | 64 (17.9) | 57 (16.3) | |
| Parental status | ||||
| Has children | 454 (64.2) | 248 (69.3) | 206 (59.0) | χ2 = 8.0851, df = 1,p < 0.05 |
| No children | 253 (35.8) | 110 (30.7) | 143 (41.0) | |
| Highest educational attainment | ||||
| Higher education | 333 (47.1) | 154 (43.0) | 179 (51.3) | χ2 = 4.0694, df = 1,p < 0.05 |
| No higher education | 374 (52.9) | 204 (57.0) | 170 (48.7) | |
| Occupation | ||||
| Employed | 375 (53.0) | 221 (61.7) | 154 (44.1) | χ2 = 25.91, df = 3, p < 0.001 |
| Student | 157 (22.2) | 56 (15.6) | 101 (28.9) | |
| Unemployed | 42 (5.9) | 20 (5.6) | 22 (6.3) | |
| Homemaker | 133 (18.8) | 61 (17.0) | 72 (20.6) |
Data presented as n (%), unless otherwise specified.
df = degrees of freedom; SD = standard deviation.
Distribution of mood disorders and differences between groups
| Variable | Total | No current suicidal behavior | Current suicidal behavior | Test |
|---|---|---|---|---|
| Major depressive disorder | 311 | 106 (34.08) | 205 (65.93) | χ2 = 67.75df = 8 p < 0.001 |
| Bipolar disorder | 112 | 62 (55.36) | 50 (44.64) | |
| Moderate depressive disorder | 53 | 30 (56.60) | 23 (43.40) | |
| Mild depressive disorder | 13 | 12 (92.31) | 1 (6.69) | |
| Anxiety disorder | 74 | 52 (70.27) | 22 (29.73) | |
| Mixed episode | 14 | 12 (85.71) | 2 (14.29) | |
| Adjustment disorder | 73 | 45 (63.01) | 27 (36.99) | |
| Dysthymia | 8 | 5 (62.50) | 3 (37.50) | |
| Other disorders | 29 | 15 (51.72) | 14 (48.28) | |
| Total | 687 | 340 | 347 | |
| N/A (missing values) | 20 | 18 | 2 |
Data presented as n (%).
df = degrees of freedom.
Missing values were excluded from the statistical analysis.
Age distribution and differences between groups
| Age (years) | Total | No current suicidal behavior | Current suicidal behavior | Test |
|---|---|---|---|---|
| 14-19 | 80 | 25 (31.25) | 55 (68.75) | χ2 = 28.82df = 5 p < 0.001 |
| 20-29 | 130 | 57 (43.85) | 73 (56.15) | |
| 30-39 | 135 | 66 (48.89) | 69 (51.11) | |
| 40-49 | 142 | 85 (59.86) | 57 (40.14) | |
| 50-59 | 156 | 81 (51.92) | 75 (48.08) | |
| > 60 | 63 | 44 (69.84) | 19 (30.16) | |
| Total | 706 | 358 | 348 | |
| N/A (missing values) | 1 | 0 | 1 |
Data presented as n (%).
df = degrees of freedom.
Missing values were excluded from the statistical analysis.
Figure 1Support vector machine (SVM) model fit with 10, 20, and 30 variables.
Parameter adjustments for each of the proposed models
| Model | No. folds | No. iterations | No. variables | Parameters for adjustment | Optimal parameters |
|---|---|---|---|---|---|
| CART | 10 | 1 | 3 | cp = complexity parameter | cp = 0.0216763 |
| KNN | 10 | 10 | 22 | k = number of neighbors | k = 17 |
| SVM | 10 | 5 | 22 | cost = regularization parameter | cost = 1 |
| sigma = radial basis function kernel parameter | sigma = 0.0625 | ||||
| Random Forest | 10 | 1 | 32 | mtry = subset of variables used in each tree | mtry = 11 |
| AdaBoost | 10 | 3 | 32 | ntree = number of trees | ntree = 33 |
| maxdepth = maximum depth of each tree | maxdepth = 4 | ||||
| alpha = type of coefficient for updating weightings | alpha = Breiman |
CART = Classification and Regression Tree; KNN = k-nearest neighbor; SVM = support vector machine.
Results of validation of the five models generated
| CART | SVM | KNN | AdaBoost | Random forest | |
|---|---|---|---|---|---|
| Accuracy | 0.72 | 0.78 | 0.73 | 0.76 | 0.78 |
| Sensitivity | 0.71 | 0.77 | 0.74 | 0.75 | 0.78 |
| Specificity | 0.74 | 0.79 | 0.73 | 0.76 | 0.77 |
CART = Classification and Regression Tree; KNN = k-nearest neighbor; SVM = support vector machine.
Figure 2Boxplot of accuracy achieved by the five models generated CART = Classification and Regression Tree; KNN = k-nearest neighbor; SVM = support vector machine.
Figure 3ROC space models. Average information from K-fold cross-validation. CART = Classification and Regression Tree; KNN = k-nearest neighbor; ROC = receiver operating characteristics; SVM = support vector machine.
Figure 4Histogram of support vector machine (SVM) model accuracy.
Variables included in the predictive model of suicide risk
| Measure | Variable | Question in the measure |
|---|---|---|
| OQ | OQPRE8_SD_n: question 8 of the OQ normalized to a range between [0,1] | I think about taking my life. |
| RFL | RFL19_20_24_SUPAF_n: average of questions 19, 20, and 24 of the RFL measure normalized to a range between [0,1] | I care about myself enough to live.Life is too beautiful and precious to bring to an end.I have a love for life. |
| RFL | RFL25_SUPAF_n: question 25 of the RFL measure normalized to a range between [0,1] | I am too stable to kill myself. |
| RFL | RFL12_SUPAF_n: question 12 of the RFL measure normalized to a range between [0,1] | Life is all we have and is better than nothing. |
| OQ | OQPRE13_SD_n: question 13 of the OQ normalized to a range between [0,1] | I am a happy person. |
| RFL | RFL5_OBMOR_n: question 5 of the RFL measure normalized to a range between [0,1] | I believe only God has the right to end a life. |
| OQ | OQPRE31_SD_n: question 31 of the OQ normalized to a range between [0,1] | I am satisfied with my life. |
| RFL | RFL10_SUPAF_n: question 10 of the RFL measure normalized to a range between [0,1] | I do not want to die. |
| Diagnosis | T_E_DEPRE_MOD_o_SEV: indicates if the diagnosis is of a moderate or severe kind of depressive disorder/event | Presents moderate depressive episode, severe depressive episode, major depressive disorder. |
| OQ | OQPRE24_SD_n: question 24 of the OQ normalized to a range between [0,1] | I am happy with myself. |
| Sociodemographic variable | TIENE_1_HIJO | Indicates if they have exactly one child. |
| RFL | RFL45_SUPAF_n: question 45 of the RFL measure normalized to a range between [0,1] | I see no reason to hurry death along. |
| RFL | RFL17_SUPAF_n: question 17 of the RFL measure normalized to a range between [0,1] | I want to experience all that life has to offer and there are many experiences I haven’t had yet that I want to have. |
| RFL | RFL22_SUPAF_n: question 22 of the RFL measure normalized to a range between [0,1] | I believe I can find other solutions to my problems. |
| DEQ | DEQPRE62_n: question 62 of the DEQ normalized to a range between [0,1] | I am very satisfied (a) with myself (b) withwhat I have achieved. |
| RFL | RFL50_n: question 50 of the RFL measure normalized to a range between [0,1] | The thought of suicide is totally incomprehensible to me. |
| OQ | OQPRE3_SD_n: question 3 of the OQ normalized to a range between [0,1] | Nothing interests me. |
| RFL | RFL2_n: question 2 of the RFL measure normalized to a range between [0,1] | I believe I can learn to adjust or cope with my problems. |
| RFL | RFL40_n: question 40 of the RFL measure normalized to a range between [0,1] | I have hopes that things will improve and the future will be happier. |
| RFL | RFL14_n: question 14 of the RFL measure normalized to a range between [0,1] | No matter how badly I feel, I know that it will not last. |
Accuracy = 0.779, sensitivity = 0.770, specificity = 0.790.
DEQ = Depressive Experience Questionnaire; OQ = Outcome Questionnaire; RFL = Reasons for Living.