| Literature DB >> 27721800 |
Andrej A Kibrik1, Mariya V Khudyakova2, Grigory B Dobrov3, Anastasia Linnik4, Dmitrij A Zalmanov5.
Abstract
We report a study of referential choice in discourse production, understood as the choice between various types of referential devices, such as pronouns and full noun phrases. Our goal is to predict referential choice, and to explore to what extent such prediction is possible. Our approach to referential choice includes a cognitively informed theoretical component, corpus analysis, machine learning methods and experimentation with human participants. Machine learning algorithms make use of 25 factors, including referent's properties (such as animacy and protagonism), the distance between a referential expression and its antecedent, the antecedent's syntactic role, and so on. Having found the predictions of our algorithm to coincide with the original almost 90% of the time, we hypothesized that fully accurate prediction is not possible because, in many situations, more than one referential option is available. This hypothesis was supported by an experimental study, in which participants answered questions about either the original text in the corpus, or about a text modified in accordance with the algorithm's prediction. Proportions of correct answers to these questions, as well as participants' rating of the questions' difficulty, suggested that divergences between the algorithm's prediction and the original referential device in the corpus occur overwhelmingly in situations where the referential choice is not categorical.Entities:
Keywords: cross-methodological approach; discourse production; machine learning; non-categoricity; referential choice
Year: 2016 PMID: 27721800 PMCID: PMC5033969 DOI: 10.3389/fpsyg.2016.01429
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
The WSJ MoRA 2015 corpus: a quantitative characterization.
| Feature | Comment | Number in corpus |
|---|---|---|
| Texts | 64 | |
| Paragraphs | 511 | |
| Sentences | 976 | |
| Elementary discourse units (EDU) | EDU segmentation of texts is automatically extracted from the RST Discourse Treebank | 2928 |
| Words | 23952 | |
Types and numbers of markables (referential expressions).
| Type of markable | Comment | Number in corpus | |
|---|---|---|---|
| 1. | Sum of #2 to #7 | ||
| 2. | Personal pronouns | 495 | |
| 3. | Possessive pronouns | 264 | |
| 4. | Zeroes | 375 | |
| 5. | Demonstratives | 67 | |
| 6. | Relative pronouns | 135 | |
| 7. | Other | 37 | |
| 8. | Sum of #9 and #18 minus #27∗) | ||
| 9. | Sum of #10 to #15 | ||
| 10. | The-descriptions | 1241 | |
| 11. | A-descriptions | 420 | |
| 12. | Bare descriptions | 1200 | |
| 13. | Demonstrative descriptions | E.g. | 88 |
| 14. | Possessive descriptions | E.g. | 490 |
| 15. | Other | 78 | |
| | |||
| 16. | Attributive descriptions | E.g. | 1458 |
| 17. | Numeral descriptions | E.g. | 136 |
| 18. | Sum of #19 to #25∗) | ||
| 19. | First names | 21 | |
| 20. | Last names | 229 | |
| 21. | First plus last names | 193 | |
| 22. | Initials plus last names | E.g. | 1 |
| 23. | Non-persons | Names of countries, organizations, units, etc. | 915 |
| 24. | Acronyms | E.g. | 277 |
| 25. | Other | 45 | |
| | |||
| 26. | Titled proper names | E.g. | 162 |
| E.g. | |||
Anaphor types.
| Anaphor type | Number used for analysis |
|---|---|
| Third person pronouns (personal or possessive) | 585 (26.0%) |
| Descriptions | 856 (38.1%) |
| Proper names | 807 (35.9%) |
| Total | 2248 (100%) |
Candidate factors of referential choice.
• Animacy: animate, inanimate, collective ( • Gender (for animate referents only): masculine, feminine, mixed ( • Person: 1, 2, 3 • Number: singular, plural • Protagonism: |
• Ordinal number of referent mention in the referential chain: • Type of phrase: noun phrase, prepositional phrase • Grammatical role: subject, direct object, indirect object, oblique (with preposition), attribute, |
• Type of phrase (values same as in the section “Anaphor’s factors”) • Grammatical role (values same as in the section “Anaphor’s factors”) • Referential form: ∘ pronoun: personal, possessive, demonstrative, relative, zero ∘ description: a-description, the-description, bare description, demonstrative description, possessive description ∘ attributive ∘ numeral ∘ proper name: first, last, first and last, initials and last, non-person, acronym ∘ Antecedent length, in words: |
• Distance in words: • Distance in all markables: • Number of markables in chain from the anaphor back to the nearest full NP antecedent: • Linear distance in EDUs: • Rhetorical distance (RhD) in elementary discourse units: • Distance in sentences: • Distance in paragraphs: |
Prediction of the basic referential choice.
| Algorithm | Accuracy | Full NP | Pronoun | ||||
|---|---|---|---|---|---|---|---|
| Precision | Recall | F1 | Precision | Recall | F1 | ||
| Baseline | 74.0% | 74.0% | 1 | 85.0% | 0 | 0 | 0 |
| C4.5 algorithm | 88.9% | 91.7% | 92.0% | 91.9% | 77.3% | 76.7% | 77.0% |
| Logistic regression | 88.6% | 91.5% | 92.6% | 92.1% | 78.5% | 76.0% | 77.2% |
| Bagging | 89.4% | 91.9% | 93.6% | 92.7% | 81.0% | 76.8% | 78.9% |
| Boosting | 89.8% | 92.2% | 93.6% | 92.8% | 80.9% | 77.4% | 79.1% |
Confusion matrix for the boosting algorithm, basic referential choice.
| Predicted full NP | Predicted third person pronoun | Total | |
|---|---|---|---|
| Original full NP | 1556 (93.6%) | 107 (6.4%) | 1663 (100%) |
| Original pronoun | 132 (22.6%) | 453 (77.4%) | 585 (100%) |
The significance of factors in modeling the basic referential choice (boosting with 50 iterations).
| Factors | Accuracy(%) |
|---|---|
| All factors | 89.8 |
| — without animacy | 89.4 |
| — without protagonism | 89.7 |
| — without the anaphor’s grammatical role | 88.3 |
| — without the antecedent’s grammatical role | 89.2 |
| — without grammatical role | 87.7 |
| — without the antecedent’s referential form | 89.4 |
| All non-distance factors only | 75.5 |
| — plus distance in all markables | 82.5 |
| — plus distances in words and paragraphs | 87.2 |
| — plus RhD, distance in words, and distance in sentences | 88.7 |
| All distance factors only | 83.2 |
Prediction of the three-way referential choice.
| Algorithm | Accuracy (%) |
|---|---|
| Baseline | 38.1 |
| C4.5 Decision tree algorithm | 72.3 |
| Logistic regression | 73.5 |
| Bagging | 73.1 |
| Boosting | 75.7 |
Readability indices for the texts used in the experimental evaluation of referential choice.
| Text | Flesch Reading Ease score ( | Gunning Fog ( | Flesch-Kincaid Grade Level ( | The Coleman-Liau Index ( | The SMOG Index ( | Automated Readability Index ( |
|---|---|---|---|---|---|---|
| 30-49: Difficult | Grade level | |||||
| 50-59: Fairly difficult | (1 to 12 correspond to school grades, 13 and higher to college levels) | |||||
| 1 | 36.0 | 17.5 | 14.1 | 14.0 | 13.7 | 15.7 |
| 2 | 58.1 | 12.3 | 9.7 | 10.0 | 9.4 | 9.5 |
| 3 | 43.5 | 17.2 | 14.1 | 9.0 | 12.9 | 14.1 |
| 4 | 38.0 | 18.1 | 15.2 | 11.0 | 13.7 | 15.8 |
| 5 | 36.9 | 15.6 | 14.0 | 11.0 | 12.7 | 13.7 |
| 6 | 46.7 | 13.7 | 11.7 | 10.0 | 12.4 | 11.1 |
| Average | 43.2 | 15.7 | 13.1 | 10.8 | 12.5 | 13.3 |
Numbers of correct and incorrect responses given by participants.
| Number of incorrect responses (out of 18) | Number of participants |
|---|---|
| 0 | 6 |
| 1 | 6 |
| 2 | 9 |
| 3 | 3 |
Numbers of correct responses to each question in the experiment and difficulty ratings.
| Question group | Question number | Correct responses | Ratings | |||
|---|---|---|---|---|---|---|
| N out of 12 | % of all responses | Mean | Median | Mode | ||
| Experimental questions, original referential expression | 1 | 11 | 91.67 | 2.83 | 3 | 3 |
| 2 | 10 | 83.33 | 2.67 | 2.5 | 2 | |
| 3 | 11 | 91.67 | 2.83 | 3 | 4 | |
| 4 | 10 | 83.33 | 2.75 | 3 | 3 | |
| 5 | 11 | 91.67 | 2.75 | 3 | 3 | |
| 6 | 11 | 91.67 | 2.50 | 2.5 | 4 | |
| Experimental questions, modified referential expression | 1 | 10 | 83.33 | 2.50 | 2.5 | 3 |
| 2 | 11 | 91.67 | 2.58 | 3 | 3 | |
| 3 | 10 | 83.33 | 2.75 | 3 | 2 | |
| 4 | 11 | 91.67 | 2.92 | 3 | 3 | |
| 5 | 11 | 91.67 | 2.83 | 3 | 4 | |
| 6 | 11 | 91.67 | 2.58 | 3 | 3 | |
| Control questions | 1 yes/no | 22 | 91.67 | 2.63 | 2.5 | 2 |
| 1 WH | 23 | 95.83 | 2.67 | 2.5 | 2 | |
| 2 yes/no | 22 | 91.67 | 2.83 | 3 | 3 | |
| 2 WH | 23 | 95.83 | 2.92 | 3 | 3 | |
| 3 yes/no | 20 | 83.33 | 2.63 | 3 | 3 | |
| 3 WH | 21 | 87.50 | 2.63 | 3 | 3 | |
| 4 yes/no | 21 | 87.50 | 2.67 | 2.5 | 2 | |
| 4 WH | 23 | 95.83 | 2.58 | 3 | 3 | |
| 5 yes/no | 18 | 75.00 | 2.67 | 3 | 3 | |
| 5 WH | 22 | 91.67 | 2.67 | 2.5 | 2 | |
| 6 yes/no | 21 | 87.50 | 2.67 | 3 | 3 | |
| 6 WH | 22 | 91.67 | 2.67 | 3 | 3 | |