| Literature DB >> 29937740 |
Merylin Monaro1, Andrea Toncini1, Stefano Ferracuti2, Gianmarco Tessari2, Maria G Vaccaro3, Pasquale De Fazio4, Giorgio Pigato5, Tiziano Meneghel6, Cristina Scarpazza1, Giuseppe Sartori1.
Abstract
Major depression is a high-prevalence mental disease with major socio-economic impact, for both the direct and the indirect costs. Major depression symptoms can be faked or exaggerated in order to obtain economic compensation from insurance companies. Critically, depression is potentially easily malingered, as the symptoms that characterize this psychiatric disorder are not difficult to emulate. Although some tools to assess malingering of psychiatric conditions are already available, they are principally based on self-reporting and are thus easily faked. In this paper, we propose a new method to automatically detect the simulation of depression, which is based on the analysis of mouse movements while the patient is engaged in a double-choice computerized task, responding to simple and complex questions about depressive symptoms. This tool clearly has a key advantage over the other tools: the kinematic movement is not consciously controllable by the subjects, and thus it is almost impossible to deceive. Two groups of subjects were recruited for the study. The first one, which was used to train different machine-learning algorithms, comprises 60 subjects (20 depressed patients and 40 healthy volunteers); the second one, which was used to test the machine-learning models, comprises 27 subjects (9 depressed patients and 18 healthy volunteers). In both groups, the healthy volunteers were randomly assigned to the liars and truth-tellers group. Machine-learning models were trained on mouse dynamics features, which were collected during the subject response, and on the number of symptoms reported by participants. Statistical results demonstrated that individuals that malingered depression reported a higher number of depressive and non-depressive symptoms than depressed participants, whereas individuals suffering from depression took more time to perform the mouse-based tasks compared to both truth-tellers and liars. Machine-learning models reached a classification accuracy up to 96% in distinguishing liars from depressed patients and truth-tellers. Despite this, the data are not conclusive, as the accuracy of the algorithm has not been compared with the accuracy of the clinicians; this study presents a possible useful method that is worth further investigation.Entities:
Keywords: automatic; decetpion; depression; machine learning; malingering
Year: 2018 PMID: 29937740 PMCID: PMC6002526 DOI: 10.3389/fpsyt.2018.00249
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 4.157
Figure 1The figure reports an example of the computer screen as appeared to the subjects during the task.
The table reports the description of the space-time features recorded by MouseTracker software.
| Temporal features | Initiation time (IT) | Time between the appearance of the question and the beginning of the mouse movement |
| Reaction time (RT) | Time from the appearance of the question to the click on the response box | |
| Maximum deviation time (MD-time) | Time to reach the point of maximum deviation | |
| Spatial features | Maximum deviation (MD) | The largest perpendicular distance between the actual trajectory and the ideal trajectory |
| Area under the curve (AUC) | The geometric area between the actual trajectory and the ideal trajectory | |
| x-flip | Number changes in direction along the | |
| y-flip | Number changes in of direction along the |
Figure 2The figure represents the average trajectories between the participants, respectively for liars (in red), truth-tellers (in green) and depressed subjects (in blue), to all questions (EX, DS, VAS, 2DS-d, 2DS-c, DS&EX -d, DS&EX-c, 2 EX-d, 2EX-c).
The table reports means (M) and standard deviations (SD) for each feature collected by the software.
| IT | 620.35 | 491.94 | 408.67 | 332.99 | 559.57 | 399.84 |
| RT | 4018.79 | 1466.98 | 6641.81 | 3204.22 | 4030.60 | 1203.67 |
| MD-time | 2392.37 | 1001.23 | 4199.85 | 1818.62 | 2297.64 | 620.98 |
| MD | 0.44 | 0.31 | 0.51 | 0.30 | 0.57 | 0.24 |
| AUC | 1.01 | 0.90 | 1.09 | 0.76 | 1.25 | 0.67 |
| x-flip | 7.64 | 2.24 | 8.75 | 2.94 | 9.23 | 2.72 |
| y-flip | 7.74 | 2.63 | 8.05 | 2.88 | 9.20 | 2.67 |
| v | 0.00627 | 0.00060 | 0.00582 | 0.00076 | 0.00573 | 0.00059 |
| v | 0.01326 | 0.00014 | 0.01315 | 0.00010 | 0.01326 | 0.00017 |
| a | −0.00001 | 0.00004 | 0.00001 | 0.00002 | 0.00000 | 0.00002 |
| a | −0.00004 | 0.00008 | −0.00006 | 0.00004 | −0.00001 | 0.00009 |
IT, initiation time; RT, reaction time; MD-time, maximum deviation time; MD, maximum deviation; AUC, area under the curve; x-flip, y-flip, average velocity and acceleration on x and y axis = v.
The table reports F-value, degrees of freedom (gdl), p-value and effect-size (Omega-squared, ω2) resulting from the comparison of the three experimental groups for the features that reached the statistical significance.
| DS | |
| EX | |
| 2DS-d | |
| 2DS-c | |
| DS&EX-c | |
| VAS | |
| RT | |
| RT DS | |
| RT EX | |
| RT 2DS-d | |
| RT 2DS-c | |
| RT DS&EX-d | |
| RT DS&EX-c | |
| RT 2EX-d | |
| RT 2EX-c | |
| RT VAS | |
| MD-time | |
| MD-time DS | |
| MD-time EX | |
| MD-time 2DS-d | |
| MD-time 2DS-c | |
| MD-time DS&EX-d | |
| MD-time DS&EX-c | |
| MD-time 2EX-d | |
| MD-time 2EX-c | |
| MD-time VAS | |
| v | |
| v |
Differences between truth-tellers and liars, liars and depressed, truth-tellers and depressed.
| DS | −7.75 | |
| 2DS-d | 5.25 | |
| 2DS-c | −11.85 | |
| DS&EX-c | −2.40 | |
| VAS | −6.80 | |
| v | 0.00053 | |
| DS | 2.45 | |
| 2DS-d | −2.15 | |
| 2DS-c | 5.00 | |
| VAS | 2.85 | |
| RT | −2611.21 | |
| RT DS | −2036.49 | |
| RT 2DS-d | −3507.4 | |
| RT 2DS-c | −2880.60 | |
| RT DS&EX-c | −5171.2 | |
| RT 2EX-d | −2708.4 | |
| RT 2EX-c | −1403.2 | |
| RT VAS | −2012.3 | |
| MD-time | −1902.21 | |
| MD-time DS | −1182.22 | |
| MD-time 2DS-d | −2494.9 | |
| MD-time 2DS-c | −2076.4 | |
| MD-time DS&EX-d | −1741.8 | |
| MD-time DS&EX-c | −4187.4 | |
| MD-TIME 2EX-d | −1968.1 | |
| MD-TIME 2EX-c | −13.45.5 | |
| MD-TIME VAS | −1514.67 | |
| v | 0.00013 | |
| DS | −5.30 | |
| EX | 0.45 | |
| 2DS-d | 3.10 | |
| 2DS-c | −6.85 | |
| DS&EX-c | −2.20 | |
| VAS | −3.95 | |
| RT | −2623.02 | |
| RT DS | −2019.08 | |
| RT EX | −1027.1 | |
| RT 2DS-d | −3183.4 | |
| RT 2DS-c | −2836.75 | |
| RT DS&EX-d | −3480 | |
| RT DS&EX-c | −4350.8 | |
| RT 2EX-c | −1621.3 | |
| RT VAS | −2225.7 | |
| MD-time | −1807.48 | |
| MD-time DS | −1100.85 | |
| MD-time EX | −560 | |
| MD-time 2DS–d | −2240.2 | |
| MD-time 2DS-c | −1937.7 | |
| MD-time DS&EX-d | −2407.9 | |
| MD-time DS&EX-c | −3742.3 | |
| MD-time 2EX-c | −1219.7 | |
| MD-time VAS | −1510.78 | |
The table reports t-value, p-value and effect-size resulting from Tukey test and the value of the difference between the compared groups. Only the results that reached the statistical significance are reported.
The table reports the 6 features resulted from the features selection.
| DS | 0.55 |
| 2DS-c | 0.52 |
| 2DS-d | 0.41 |
| VAS | 0.52 |
| MD-time 2DS-d | 0.36 |
| MD-time VAS | 0.37 |
The second column reports the value of the correlation between the feature and the dependent variable (truth-teller vs. liar vs. depressed).
Accuracies obtained by four different ML classifiers in 10-fold cross-validation and in test set.
| Naïve Bayes | 90 | 96.3 |
| SMO | 83.3 | 92.6 |
| LMT | 81.6 | 96.3 |
| Random Forest | 80 | 92.6 |
Classifiers are Naïve Bayes, Sequential Minimal Optimization (SMO), Logistic Model Tree (LMT) and Random Forest.
Classification accuracies of liars and depressed participants by four different ML classifiers in 10-fold cross-validation and in test set.
| Naïve Bayes | 80 | 94.4 |
| SMO | 82.5 | 88.9 |
| LMT | 80 | 88.9 |
| Random Forest | 87.5 | 94.4 |
Classifiers are Naïve Bayes, Sequential Minimal Optimization (SMO), Logistic Model Tree (LMT) and Random Forest.
Figure 3The figure reports the structure of the decision tree model. Participants declaring <3.5 symtomps on 2DS-c questions are classified as truth-tellers. Participants declaring more than 3.5 symtomps on 2DS-c questions are depressed patients if and only if they take more than 4048 ms to compute the response to the 2DS-d questions, otherwise they are classified as liars.
Classification of participants using two set of alternative predictors.
| Naïve Bayes | 85 | 88.9 |
| SMO | 83.3 | 88.9 |
| LMT | 80 | 92.6 |
| Random Forest | 80 | 88.9 |
| Naïve Bayes | 81.6 | 96.3 |
| SMO | 78.3 | 92.6 |
| LMT | 76.7 | 85.2 |
| Random Forest | 75 | 92.6 |
Alternative models were computed using four different ML classifiers (Naïve Bayes, SMO, LMT, Random Forest). Accuracies in 10-fold cross-validation and in test set are reported.