| Literature DB >> 27630592 |
Roman Kutlak1, Kees van Deemter1, Chris Mellish1.
Abstract
This article presents a computational model of the production of referring expressions under uncertainty over the hearer's knowledge. Although situations where the hearer's knowledge is uncertain have seldom been addressed in the computational literature, they are common in ordinary communication, for example when a writer addresses an unknown audience, or when a speaker addresses a stranger. We propose a computational model composed of three complimentary heuristics based on, respectively, an estimation of the recipient's knowledge, an estimation of the extent to which a property is unexpected, and the question of what is the optimum number of properties in a given situation. The model was tested in an experiment with human readers, in which it was compared against the Incremental Algorithm and human-produced descriptions. The results suggest that the new model outperforms the Incremental Algorithm in terms of the proportion of correctly identified entities and in terms of the perceived quality of the generated descriptions.Entities:
Keywords: audience design; common ground; computational model; corpus; generation of referring expressions
Year: 2016 PMID: 27630592 PMCID: PMC5005345 DOI: 10.3389/fpsyg.2016.01275
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
A sample of referent names and corresponding descriptions in the corpus.
| Albert Einstein | This person was the author of the theory of relativity |
| This person was the physicist who developed the Theory of Relativity that revolutionized how we understand space, time, and gravity | |
| This person was the German-American mathematician | |
| Thomas Edison | This person invented the light bulb |
| This person was most famous for his inventions of the light blub and the phonograph | |
| This person was the inventor of the light bulb, phonograph, and movie projector | |
| Elvis Presley | This person was the King of Rock “n” Roll |
| This person was the King of Rock and Roll, born in Tupelo, Mississippi, who had Graceland built | |
| This person was one of the most popular singers ever, with hits including Blue Suede Shoes and Jailhouse Rock |
Figure 1Names of famous people used in the pilot test of the potential metrics for the Knowledge Heuristic.
List of properties true of Albert Einstein.
| Albert Einstein was a physicist | 80.95 | 827000 | 4.00 |
| Albert Einstein invented the theory of relativity | 80.43 | 69600 | 3.00 |
| Albert Einstein was German | 67.39 | 1060000 | 5.00 |
| Albert Einstein emigrated to the United States | 47.06 | 20200 | 2.00 |
| Albert Einstein was a professor at the Karl-Ferdinand University in Prague | 30.30 | 652 | 1.00 |
The percentage of affirmative answers show what percentage of participants believed the statement to be true. Rank shows how the corresponding properties ranked according to the . Spearman correlation between percentage and frequency rs(48) = 0.67;p < 0.001.
Pilot results: Spearman correlation between the metrics and knowledge of hearers.
| 0.667 | −0.063 | 0.672 | 0.628 | |
| 0.000 | 0.663 | 0.000 | 0.000 |
Figure 2Names of famous people used in the test of the Knowledge Heuristic.
List of properties of Ernest Hemingway.
| Ernest Hemingway was a writer | True | 100.0 | 1 |
| Ernest Hemingway was American | True | 100.0 | 2 |
| Ernest Hemingway received the Nobel Prize in Literature | True | 63.6 | 5 |
| Ernest Hemingway is the author of For whom the bell tolls | True | 54.5 | 4 |
| Ernest Hemingway committed a suicide | True | 50.0 | 3 |
| Ernest Hemingway was British | False | 27.3 | – |
| Ernest Hemingway was born in Oak Park | True | 25.0 | 6 |
| Ernest Hemingway received the Italian Silver Medal of Bravery | True | 20.0 | 7 |
| Ernest Hemingway is the author of A tale of two cities | False | 13.3 | – |
| Ernest Hemingway invented dynamite | False | 0.0 | – |
| Ernest Hemingway died in a plane crash | False | 0.0 | – |
| Ernest Hemingway was born in Paris | False | 0.0 | – |
Condition shows whether a property was true or false (a filler) and the percentage of affirmative answers shows what percentage of participants believed the statement to be true. Rank shows how the corresponding properties ranked according to the . Spearman correlation between percentage and rank rs(68) = 0.73;p < 0.001.
Unexpectedness of some properties, by Equation (5) and by DP, calculated across DBpedia.
| 〈 | 0 | 0.0 |
| 〈 | 11 | 0.6233 |
| 〈 | 89 | 0.9962 |
| 〈 | 430 | 0.9998 |
Figure 3Pseudocode describing the termination heuristic. The heuristic returns true (and terminates the algorithm) when adding a property to a description does not result in a large decrease in the number of matching documents.
Figure 4Pseudocode for algorithms .
Figure 5Presentation of descriptions in the evaluation. Participants provided judgment of each description by moving the sliders. The box at the bottom of the page was used for providing the name.
Figure 6Options shown to participants after guessing the name of the described person.
Counts of selected answers for individual algorithms in the final evaluation.
| Correct identification | 68 | 58 | 89 | 100 | 180 | 495 |
| Tip-of-the-Tongue (ToT) | 21 | 17 | 24 | 24 | 28 | 114 |
| Unknown target | 44 | 64 | 61 | 61 | 51 | 281 |
| Unknown properties | 55 | 136 | 106 | 94 | 33 | 424 |
| Underspecified | 104 | 21 | 7 | 10 | 2 | 144 |
| At odds with my information | 2 | 3 | 8 | 6 | 3 | 22 |
| Other | 6 | 1 | 5 | 4 | 2 | 18 |
| Total | 300 | 300 | 300 | 299 | 299 | 1498 |
are descriptions from DBpedia, is the Incremental Algorithm, is the new algorithm that always selects three properties, is the new algorithm that uses document retrieval as a termination heuristic and are descriptions produced by a native English speaker.
Counts of correctly and incorrectly identified referents.
| Correct identification | 68 | 58 | 89 | 100 | 180 | 495 |
| Incorrect identification | 167 | 161 | 126 | 114 | 40 | 608 |
| Total | 235 | 219 | 215 | 214 | 220 | 1103 |
| Proportion correct | 0.29 | 0.26 | 0.41 | 0.47 | 0.82 | 0.45 |
| Proportion incorrect | 0.71 | 0.74 | 0.59 | 0.53 | 0.18 | 0.55 |
| Correct/incorrect | 0.41 | 0.36 | 0.71 | 0.88 | 4.50 | 0.81 |
The table also shows the proportions as well as the ration of correctly and incorrectly identified referents.
Counts of correctly and incorrectly identified referents when descriptions that contained a clue as to the identity of the referent were removed.
| Correct identification | 44 | 41 | 49 | 63 | 123 | 320 |
| Incorrect identification | 112 | 110 | 97 | 86 | 32 | 437 |
| Total | 156 | 151 | 146 | 149 | 155 | 757 |
| Proportion correct | 0.28 | 0.27 | 0.34 | 0.42 | 0.79 | 0.42 |
| Proportion incorrect | 0.72 | 0.73 | 0.66 | 0.58 | 0.21 | 0.58 |
| Correct/incorrect | 0.39 | 0.37 | 0.51 | 0.73 | 3.84 | 0.73 |
The table also shows the proportions as well as the ratio of correctly and incorrectly identified referents.
Homogeneous subsets for counts of .
| A | 123 | 155 | |||
| B | 63 | 149 | |||
| B | C | 49 | 146 | ||
| C | 44 | 156 | |||
| C | 41 | 151 |
Algorithms that do not share a letter are significantly different with p < 0.05.
Mean ratings and standard deviations for .
| Mean quality | 43.570 | 57.857 | 67.600 | 66.786 | 77.552 |
| Quality SD | 27.385 | 20.630 | 16.670 | 18.051 | 15.570 |
| Mean naturalness | 61.953 | 61.927 | 62.110 | 61.495 | 70.311 |
| Naturalness SD | 18.587 | 17.294 | 17.758 | 18.655 | 15.556 |
Quality refers to the statement: “Suppose you did not know this person, how good would you find the description?” and naturalness refers to the statement: “How natural does the description read to you?”
Homogeneous subsets for .
| A | 77.6 | 15.6 | ||||
| B | 66.8 | 18.1 | ||||
| B | 67.6 | 16.7 | ||||
| C | 57.9 | 20.6 | ||||
| D | 43.6 | 27.4 |
Algorithms that do not share a letter are significantly different at p < 0.05.
Homogeneous subsets for .
| A | 70.3 | 15.6 | ||
| B | 61.5 | 18.7 | ||
| B | 62.1 | 17.8 | ||
| B | 61.9 | 17.3 | ||
| B | 62.0 | 18.6 |
Algorithms that do not share a letter are significantly different at p < 0.05
| Lion | Kenya | 102 kg | Paws, teeth | |
| Lion | China | 100 kg | Paws | |
| Tiger | China | 310 kg | Back |