| Literature DB >> 30001373 |
Abstract
Modern authorship attribution methods are often comprised of powerful yet opaque machine learning algorithms. While much of this work lends itself to concrete outcomes in the form of probability scores, advanced approaches typically preclude deeper insights in the form of psychological interpretation. Additionally, few attribution methods exist for single-candidate authorship problems, most of which require large amounts of supplemental data to perform and none of which rely upon explicitly psychological measures. The current study introduces Mental Profile Mapping, a new authorship attribution technique for single-candidate authorship questions that is founded on previous scientific research pertaining to the nature of language and psychology. In the current study, baseline expectations for results and performance are set using an advanced technique known as "unmasking" on the test case of Aphra Behn, a 17th century English playwright. Following this, Mental Profile Mapping is introduced and tested for its psychometric properties, tested using a "bogus insertion" method, and then applied to canonical Aphra Behn plays. Results from both attribution methods suggest that 2 of 5 questioned plays are likely to have been authored by Behn, with the remaining 3 plays exhibiting a poor fit for Behn's psychological fingerprint. Mental Profile Mapping results are then decomposed into deeper psychological interpretation, a quality unique to this new method.Entities:
Mesh:
Year: 2018 PMID: 30001373 PMCID: PMC6042775 DOI: 10.1371/journal.pone.0200588
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Aphra Behn plays included in the current analyses.
| The Forc'd Marriage | The Rover, Part I* | The Roundheads |
|---|---|---|
| The Amorous Prince | Sir Patient Fancy | The False Count |
| The Dutch Lover | The Feigned Courtesans | The Young King |
| Abdelazer* | The Rover, Part II* | The Emperor of the Moon |
| The Town Fop | The City Heiress | The Lucky Chance |
| The Widow Ranter |
Note: Adaptions of other people’s work are denoted with an asterisk (*).
Results from the unmasking analysis.
| Questioned Play Title | Comparison Author | Authorship Match | Result |
|---|---|---|---|
| The Counterfeit Bridegroom | Behn | Different | 82.08% |
| The Counterfeit Bridegroom | Marlowe | Different | 99.83% |
| The Counterfeit Bridegroom | Fletcher | Different | 99.40% |
| The Counterfeit Bridegroom | Rowley | Different | 98.99% |
| The Counterfeit Bridegroom | Theobald | Different | 99.97% |
| The Counterfeit Bridegroom | Dekker | Different | 98.61% |
| The Counterfeit Bridegroom | Shakespeare | Different | 96.05% |
| The Debauchee | Behn | Different | 94.41% |
| The Debauchee | Marlowe | Different | 99.98% |
| The Debauchee | Fletcher | Different | 95.07% |
| The Debauchee | Rowley | Different | 99.71% |
| The Debauchee | Theobald | Different | 100.00% |
| The Debauchee | Dekker | Different | 99.52% |
| The Debauchee | Shakespeare | Different | 98.32% |
| The Revenge | Behn | Match | 99.98% |
| The Revenge | Marlowe | Different | 99.72% |
| The Revenge | Fletcher | Different | 98.10% |
| The Revenge | Rowley | Different | 99.48% |
| The Revenge | Theobald | Different | 99.89% |
| The Revenge | Dekker | Different | 94.90% |
| The Revenge | Shakespeare | Different | 96.88% |
| The Woman Turned Bully | Behn | Different | 88.53% |
| The Woman Turned Bully | Marlowe | Different | 99.93% |
| The Woman Turned Bully | Fletcher | Different | 97.80% |
| The Woman Turned Bully | Rowley | Different | 99.87% |
| The Woman Turned Bully | Theobald | Different | 99.94% |
| The Woman Turned Bully | Dekker | Different | 99.94% |
| The Woman Turned Bully | Shakespeare | Different | 87.51% |
| The Younger Brother | Behn | Match | 55.10% |
| The Younger Brother | Marlowe | Different | 99.93% |
| The Younger Brother | Fletcher | Different | 99.91% |
| The Younger Brother | Rowley | Different | 99.58% |
| The Younger Brother | Theobald | Different | 99.69% |
| The Younger Brother | Dekker | Different | 99.55% |
| The Younger Brother | Shakespeare | Different | 99.11% |
Note: Plays highlighted in green indicate a play’s match with a playwright’s psychological signature. Plays highlighted in light red indicate a play’s non-match with supplemental playwrights. Plays highlighted in bright red indicate a play’s non-match with Behn’s psychological signature. Unmasking data was created by averaging results across 20 randomized iterations.
Psychological process compositions.
| Psychological Process | LIWC2015 Variables included in Distance Calculation |
|---|---|
| 1: Style | Analytic, Clout, Authentic, Tone |
| 2: Complexity | Analytic, Sixltr |
| 3: Function Words | i, we, you, shehe, they, ipron, article, prep, auxverb, adverb, conj, negate, interrog |
| 4: Emotional | affect, posemo, negemo, anx, anger, sad |
| 5: Social | social, family, friend, female, male |
| 6: Cognitive | cogproc, insight, cause, discrep, tentat, certain, differ |
| 7: Perceptual | percept, see, hear, feel |
| 8: Biological | bio, body, health, sexual, ingest |
| 9: Motivational | drives, affiliation, achieve, power, reward, risk |
| 10: Temporal | focuspast, focuspresent, focusfuture |
| 11: Relational | relative, motion, space, time |
| 12: Personal | work, leisure, home, money, relig, death |
| 13: Utterances | informal, swear, assent, nonflu |
Note: Each process consists of several subdimensions that are factored together when calculating distance metrics.
Internal consistency of distance measures for each author.
| Author | Cronbach’s Alpha of Distance Measures |
|---|---|
| Behn | .66 |
| Fletcher | .54 |
| Marlowe | .64 |
| Shakespeare | .52 |
| Theobald | .70 |
| Dekker | .66 |
Note: These results are congruent with other work on the internal consistency of psychological measures of language [19] and demonstrate that language-based measures of psychological processes do, in fact, vary in unison from the center.
Summary statistics and inter-item correlations for all distance measures calculated for the verified plays of Aphra Behn (i.e., excluding questioned plays).
| Distance Measures | Mean (SD) | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1: Style | 46.24 (26.87) | – | |||||||||||
| 2: Complexity | 48.05 (30.81) | -0.152 | – | ||||||||||
| 3: Function Words | 45.34 (14.32) | 0.410 | 0.281 | – | |||||||||
| 4: Emotional | 49.08 (28.42) | 0.337 | -0.234 | -0.143 | – | ||||||||
| 5: Social | 50.29 (28.67) | 0.210 | -0.200 | -0.324 | 0.230 | – | |||||||
| 6: Cognitive | 47.14 (27.31) | 0.019 | 0.179 | 0.439 | -0.388 | -0.317 | – | ||||||
| 7: Perceptual | 50.36 (28.47) | 0.163 | 0.301 | 0.037 | 0.188 | 0.626 | -0.343 | – | |||||
| 8: Biological | 54.16 (29.81) | 0.522 | 0.006 | 0.057 | 0.467 | 0.503 | -0.321 | 0.567 | – | ||||
| 9: Motivational | 53.49 (31.59) | 0.497 | -0.474 | -0.141 | 0.618 | 0.363 | -0.095 | 0.193 | 0.391 | – | |||
| 10: Temporal | 47.03 (30.58) | 0.297 | 0.380 | 0.232 | -0.176 | -0.196 | 0.312 | -0.242 | -0.116 | -0.178 | – | ||
| 11: Relational | 47.01 (26.66) | 0.641 | -0.118 | 0.286 | 0.277 | 0.170 | -0.177 | 0.174 | 0.448 | 0.052 | -0.071 | – | |
| 12: Personal | 49.87 (29.72) | 0.542 | 0.072 | 0.198 | 0.545 | 0.154 | -0.269 | 0.194 | 0.714 | 0.276 | -0.035 | 0.515 | – |
| 13: Utterances | 50.41 (29.39) | -0.221 | 0.555 | 0.122 | 0.030 | -0.224 | 0.118 | 0.088 | 0.209 | -0.271 | -0.016 | -0.219 | 0.350 |
Note: The resulting Cronbach’s alpha for all distance measures (α = .66) suggests sufficient intercorrelation to constitute collapsing the distance measures into a single, meaningful psychological metric.
Fig 1Visualized mental profile maps of 6 playwrights: a) Aphra Behn, b) Thomas Dekker, c) John Fletcher, d) Christopher Marlowe, e) William Shakespeare, and f) Lewis Theobald. Note: The center point (0,0) is the approximate “psychological center” for each playwright when collapsing across all 13 psychological process categories. Note that the projection of distance measures down to 2 dimensions does result in some distortion, and this graph should be interpreted as an approximation of the “true” location of each play. In other words, some plays may actually be somewhat closer to (or farther from) the center of each map than what the image may suggest.
Results of the MPM analysis.
| Author | Title | Mental Profile |
|---|---|---|
| Behn | The Lucky Chance | 76.59 |
| Behn | Sir Patient Fancy | 69.41 |
| Behn | The Young King | 68.98 |
| Behn | The Feigned Courtesans | 61.26 |
| Behn | The False Count | 60.56 |
| Behn | The Town Fop | 60.56 |
| Behn | The Dutch Lover | 58.88 |
| Behn | The Rover, Part I | 53.59 |
| Behn | The City Heiress | 50.00 |
| Behn | The Roundheads | 46.36 |
| Behn | The Forc'd Marriage | 45.76 |
| Behn | The Rover, Part II | 42.13 |
| Behn | The Amorous Prince | 41.02 |
| Behn | Abdelazer | 33.98 |
| Behn | The Emperor of the Moon | 24.43 |
| Shakespeare | Julius Caesar | 24.20 |
| Theobald | The Fatal Secret | 21.11 |
| Dekker | The Whore of Babylon | 18.09 |
| Behn | The Widow Ranter | 17.64 |
Note: MPM scores for plays marked as “Behn” authorship are averaged across each MPM analysis (correlation with Behn-only analysis: r = .95). Works highlighted in yellow are those that were artificially inserted as bogus Behn plays.
Fig 2Visualized mental profile maps of Aphra Behn when including bogus plays by other playwrights.
Note: The blue diamond denotes the psychological center of each map. Bogus plays are circled in red and stand out considerably from the rest of the map in each case. Bogus plays include Dekker’s The Whore of Babylon (left), Shakespeare’s Julius Caesar (middle), and Theobald’s The Fatal Secret (right).
Results of the MPM analysis.
MPM scores for plays marked with “Behn” authorship are the result of the MPM analysis that included only verified Behn plays. Works highlighted in yellow are those of questioned authorship. Higher Grand MPM scores are indicative of plays with a general low distance from center (i.e., a better fit with Behn’s mental profile map).
| Author | Title | Grand MPM | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Behn | The Lucky Chance | 72.02 | 69.42 | 96.94 | 68.38 | 14.84 | 50.54 | 72.02 | 75.64 | 75.21 | 3.04 | 94.39 | 81.37 | 62.60 | 87.84 |
| Behn | The Young King | 70.56 | 83.71 | 44.34 | 73.98 | 70.56 | 47.75 | 19.60 | 90.49 | 81.69 | 69.88 | 33.71 | 85.94 | 75.14 | 12.98 |
| Questioned | The Younger Brother | 67.60 | 67.60 | 68.81 | 29.27 | 54.76 | 27.29 | 88.03 | 33.52 | 53.14 | 71.21 | 77.16 | 10.11 | 88.00 | 79.82 |
| Behn | Sir Patient Fancy | 66.26 | 94.83 | 29.76 | 29.22 | 66.26 | 73.22 | 29.03 | 39.74 | 77.54 | 83.89 | 92.20 | 70.82 | 47.62 | 5.38 |
| Behn | The Feigned Courtesans | 62.54 | 41.50 | 60.35 | 36.47 | 93.81 | 62.54 | 21.89 | 48.67 | 65.31 | 85.75 | 84.21 | 30.48 | 84.60 | 66.79 |
| Behn | The False Count | 58.94 | 36.57 | 93.89 | 42.78 | 68.39 | 17.45 | 69.25 | 67.80 | 57.58 | 76.97 | 30.20 | 28.58 | 58.94 | 89.54 |
| Behn | The Town Fop | 56.17 | 56.17 | 72.93 | 65.45 | 23.58 | 50.61 | 75.45 | 39.49 | 76.23 | 54.98 | 45.95 | 28.91 | 72.18 | 88.46 |
| Behn | The Rover, Part I | 54.98 | 91.05 | 14.78 | 54.98 | 68.63 | 32.55 | 73.98 | 13.39 | 35.02 | 85.04 | 41.39 | 90.57 | 80.09 | 40.48 |
| Behn | The City Heiress | 50.75 | 50.75 | 49.00 | 36.70 | 68.04 | 82.15 | 18.78 | 96.26 | 68.83 | 75.59 | 38.96 | 13.39 | 45.18 | 77.44 |
| Questioned | The Revenge | 49.78 | 92.11 | 90.84 | 28.70 | 4.30 | 7.51 | 31.98 | 68.13 | 58.54 | 59.78 | 70.88 | 45.73 | 32.47 | 49.78 |
| Behn | The Forc'd Marriage | 45.99 | 45.99 | 13.37 | 25.63 | 31.94 | 88.31 | 11.08 | 68.95 | 84.46 | 65.84 | 9.33 | 56.16 | 80.95 | 29.76 |
| Behn | The Roundheads | 44.51 | 44.51 | 16.89 | 55.40 | 49.21 | 53.77 | 85.26 | 35.96 | 27.67 | 78.52 | 59.36 | 25.46 | 7.69 | 3.03 |
| Behn | The Amorous Prince | 42.31 | 29.00 | 19.26 | 33.14 | 42.31 | 88.81 | 70.89 | 73.44 | 70.77 | 79.00 | 13.71 | 44.97 | 17.98 | 38.26 |
| Behn | The Rover, Part II | 40.83 | 22.85 | 64.91 | 40.83 | 40.42 | 3.79 | 27.25 | 31.95 | 54.26 | 20.72 | 45.90 | 52.28 | 41.12 | 59.97 |
| Behn | The Dutch Lover | 40.31 | 32.16 | 31.30 | 39.82 | 98.54 | 60.88 | 29.40 | 40.31 | 81.61 | 29.74 | 12.27 | 69.63 | 87.87 | 78.13 |
| Behn | Abdelazer | 35.53 | 8.56 | 92.99 | 36.36 | 20.38 | 74.22 | 35.53 | 71.12 | 6.14 | 5.37 | 35.97 | 45.08 | 9.91 | 31.47 |
| Questioned | The Counterfeit Bridegroom | 34.60 | 17.31 | 86.96 | 30.93 | 34.60 | 2.51 | 43.92 | 42.09 | 87.44 | 35.15 | 10.21 | 5.04 | 85.49 | 29.15 |
| Questioned | The Woman Turned Bully | 33.58 | 3.61 | 69.86 | 31.66 | 18.59 | 59.72 | 20.98 | 33.58 | 55.48 | 44.41 | 5.25 | 6.53 | 52.57 | 96.00 |
| Questioned | The Debauchee | 32.34 | 37.53 | 32.34 | 29.25 | 2.46 | 29.24 | 45.32 | 8.71 | 26.37 | 39.47 | 20.70 | 49.54 | 39.25 | 66.22 |
| Behn | The Widow Ranter | 20.03 | 20.03 | 61.74 | 46.74 | 2.12 | 4.40 | 86.16 | 1.48 | 1.34 | 3.12 | 99.67 | 4.71 | 20.69 | 48.53 |
| Behn | The Emperor of the Moon | 15.23 | 12.70 | 6.35 | 39.41 | 26.21 | 13.65 | 28.59 | 11.05 | 2.89 | 38.38 | 15.23 | 23.70 | 5.30 | 48.47 |
Note: MPM Distance Measures Key: 1: Style, 2: Complexity, 3: Function Words, 4: Emotional, 5: Social
6: Cognitive, 7: Perceptual, 8: Biological, 9: Motivational
10: Temporal, 11: Relational, 12: Personal, 13: Utterances
Fig 3Visualization of Behn’s mental profile map when including The Revenge (left), a play that shows a strong MPM score, and The Debauchee (right), a play that shows a weak MPM score. Note: In both maps, the blue diamond represents the composite center of the map. As with Figs 1 and 2, the locations of each play should be considered slightly distorted approximations.
Decomposed categories of psychological processes that showed particularly great distance from center for the 3 questioned plays with poor support (i.e., MPM scores < = 20) for Behn’s authorship.
| Play Title | Highly Discrepant Psychological Processes |
|---|---|
| The Counterfeit Bridegroom | Style, Social, Temporal, Relational |
| The Woman Turned Bully | Style, Emotional, Motivational, Temporal |
| The Debauchee | Affect, Perceptual |