| Literature DB >> 29861544 |
Bennett Kleinberg1, Yaloe van der Toolen1, Aldert Vrij2, Arnoud Arntz1, Bruno Verschuere1.
Abstract
Recently, verbal credibility assessment has been extended to the detection of deceptive intentions, the use of a model statement, and predictive modeling. The current investigation combines these 3 elements to detect deceptive intentions on a large scale. Participants read a model statement and wrote a truthful or deceptive statement about their planned weekend activities (Experiment 1). With the use of linguistic features for machine learning, more than 80% of the participants were classified correctly. Exploratory analyses suggested that liars included more person and location references than truth-tellers. Experiment 2 examined whether these findings replicated on independent-sample data. The classification accuracies remained well above chance level but dropped to 63%. Experiment 2 corroborated the finding that liars' statements are richer in location and person references than truth-tellers' statements. Together, these findings suggest that liars may over-prepare their statements. Predictive modeling shows promise as an automated veracity assessment approach but needs validation on independent data.Entities:
Keywords: credibility assessment; intentions; machine learning; model statement; verbal deception detection
Year: 2018 PMID: 29861544 PMCID: PMC5969289 DOI: 10.1002/acp.3407
Source DB: PubMed Journal: Appl Cogn Psychol ISSN: 0888-4080
Summary table with confirmatory analyses for Experiment 1 (M, SD, Cohen's d)
| Dependent variable | Past | Future | Main effect veracity | Main effect time | Veracity * Time Interaction | Hyp. | Expected truth–lie difference supported? | ||
|---|---|---|---|---|---|---|---|---|---|
| Truthful | Deceptive | Truthful | Deceptive | ||||||
| Number of words | 261.68 (141.65) | 284.12 (172.92) | 233.72 (139.92) | 210.38 (114.88) | 0.00 ( | 0.18 | 0.08 ( | T > D | No |
| Richness of detail (LIWC) | 19.26 (4.39) | 19.20 (2.88) | 17.83 (4.49) | 18.04 (4.39) | 0.01 ( | 0.16 ( | 0.02 ( | T > D | No |
| % of named entities | 3.35 (2.18) | 4.16 (1.60) | 3.90 (1.89) | 3.85 (2.06) | 0.10 ( | 0.03 ( | 0.11 ( | T > D | No |
| Richness of detail (human coded) | 4.22 (1.64) | 4.97 (1.24) | 4.43 (1.34) | 4.14 (1.45) | 0.08 ( | 0.11 ( | 0.18 ( | T > D | No |
| How‐utterances (human coded) | 5.16 (1.24) | 4.63 (0.80) | 4.60 (1.04) | 4.00 (1.13) | 0.26 | 0.28 | 0.02 ( | T > D | Yes |
| Why‐utterances (human coded) | 3.23 (1.26) | 3.20 (1.40) | 3.24 (1.06) | 3.25 (1.44) | 0.00 ( | 0.01 ( | 0.01 ( | D > T | No |
p < .007.
Accuracies of the supervised machine learning task (linear support vector machine) for two different LIWC feature sets
| Feature set | Data | Accuracy [95% CI] | Sens. | Spec. | AUC (95% CI) |
|---|---|---|---|---|---|
| Complete LIWC | Past weekend plans | 69.23 [48.21, 85.67] | 71.43 | 66.67 | 0.70 [0.48, 0.91] |
| Forthcoming weekend plans | 80.65 [62.53, 92.55] | 62.50 | 100.00 | 0.75 [0.56, 0.94] | |
| Psychological processes | Past weekend plans | 61.54 [40.57, 79.99] | 78.87 | 41.67 | 0.77 [0.58, 0.96] |
| Forthcoming weekend plans | 74.19 [55.39, 88.14] | 62.50 | 86.67 | 0.78 [0.62, 0.94] |
Note. LIWC = Linguistic Inquiry and Word Count; Sens. = sensitivity; Spec. = specificity.
Significantly better than the chance level.
Means (SDs, Cohen's d) for the dependent variables used in the exploratory analyses per time and veracity
| Dependent variable | Main effect veracity | Past weekend plans | Future weekend plans | ||||
|---|---|---|---|---|---|---|---|
| Truthful | Deceptive | Main effect veracity | Truthful | Deceptive | Main effect veracity | ||
| Richness in detail: percept | −0.12 | 1.93 (1.37) | 2.10 (1.26) | −0.06 | 1.52 (1.53) | 1.99 (1.30) | −0.17 |
| Richness in detail: time | 0.23 | 8.96 (3.05) | 8.09 (2.01) | 0.12 | 8.75 (3.41) | 7.16 (2.36) | 0.27 |
| Richness in detail: space | −0.17 | 8.38 (2.67) | 9.02 (2.56) | −0.17 | 7.56 (2.96) | 8.90 (3.43) | −0.21 |
| Function words (function) | −0.16 | 53.49 (4.08) | 55.35 (3.12) | −0.25 | 55.89 (3.99) | 56.46 (3.90) | −0.07 |
| Personal pronouns (ppron) | −0.09 | 9.67 (2.70) | 10.79 (2.30) | −0.22 | 10.69 (2.56) | 10.53 (2.53) | 0.03 |
| First person singular (i) | 0.24 | 6.53 (2.70) | 5.15 (2.69) | 0.26 | 6.80 (3.54) | 5.40 (2.55) | 0.23 |
| Numbers (number) | 0.12 | 1.86 (1.42) | 1.57 (1.03) | 0.12 | 1.70 (1.47) | 1.41 (1.04) | 0.11 |
| Persons | −0.32 | 0.29 (0.49) | 0.76 (0.70) | −0.39 | 0.34 (0.63) | 0.73 (0.77) | −0.27 |
| Geopolitical entities | −0.25 | 0.17 (0.45) | 0.48 (0.59) | −0.30 | 0.27 (0.51) | 0.51 (0.66) | −0.21 |
| Dates | 0.13 | 1.11 (0.77) | 1.06 (0.62) | 0.03 | 1.56 (0.98) | 1.17 (0.89) | 0.21 |
| Time | 0.12 | 0.54 (0.68) | 0.56 (0.52) | −0.02 | 0.53 (0.59) | 0.29 (0.42) | 0.24 |
| Ordinal | 0.17 | 0.24 (0.39) | 0.09 (0.20) | 0.25 | 0.13 (0.28) | 0.09 (0.22) | 0.08 |
Note. Negative effect sizes imply higher values in deceptive than in truthful statements.
p < .05.
p < .01.
Summary table with the confirmatory analyses for Experiment 2 (M, SD, Cohen's d)
| Dependent variable | Without model statement | With model statement | Main effect veracity | Main effect model statement | Veracity * Model Statement | Hyp. | Expected truth–lie difference supported? | ||
|---|---|---|---|---|---|---|---|---|---|
| Truthful | Deceptive | Truthful | Deceptive | ||||||
| Person references (NER) | 16.58 (51.59) | 23.59 (51.22) | 23.68 (48.18) | 40.23 (54.53) | −0.11 | 0.12 | 0.04 ( | D > T | Yes |
| Location references (NER) | 18.91 (49.71) | 29.67 (57.63) | 24.55 (57.81) | 42.82 (68.69) | −0.12 | 0.08 ( | 0.03 ( | D > T | Yes |
| Temporal information (LIWC) | 9.10 (3.79) | 9.06 (3.92) | 7.91 (2.69) | 8.01 (3.07) | 0.01 ( | 0.16 | 0.01 ( | T > D | No |
| Spatial information (LIWC) | 7.56 (3.58) | 7.81 (2.93) | 7.86 (3.11) | 7.88 (2.96) | 0.02 ( | 0.03 ( | 0.02 ( | D > T | No |
| Date references (NER) | 170.54 (124.20) | 175.76 (132.47) | 133.89 (93.32) | 139.11 (97.44) | 0.02 ( | 0.16 | 0.00 ( | T > D | No |
| Time references (NER) | 52.79 (87.51) | 36.91 (61.22) | 45.60 (55.57) | 48.95 (55.99) | 0.05 ( | 0.02 ( | 0.07 ( | T > D | No |
| Number of words | 121.83 (57.37) | 118.55 (48.54) | 202.88 (107.36) | 188.43 (93.47) | 0.06 ( | 0.48 | 0.04 ( | — | — |
Note. Negative effect sizes imply higher values in deceptive than in truthful statements. LIWC = Linguistic Inquiry and Word Count; NER, named entity recognition.
p < .05.
p < .01.
p < .001.