| Literature DB >> 26235596 |
Justine T Kao1, Roger Levy2, Noah D Goodman1.
Abstract
Humor plays an essential role in human interactions. Precisely what makes something funny, however, remains elusive. While research on natural language understanding has made significant advancements in recent years, there has been little direct integration of humor research with computational models of language understanding. In this paper, we propose two information-theoretic measures-ambiguity and distinctiveness-derived from a simple model of sentence processing. We test these measures on a set of puns and regular sentences and show that they correlate significantly with human judgments of funniness. Moreover, within a set of puns, the distinctiveness measure distinguishes exceptionally funny puns from mediocre ones. Our work is the first, to our knowledge, to integrate a computational model of general language understanding and humor theory to quantitatively predict humor at a fine-grained level. We present it as an example of a framework for applying models of language processing to understand higher level linguistic and cognitive phenomena.Entities:
Keywords: Ambiguity; Computational modeling; Humor; Language processing; Noisy channel
Mesh:
Year: 2015 PMID: 26235596 PMCID: PMC5042108 DOI: 10.1111/cogs.12269
Source DB: PubMed Journal: Cogn Sci ISSN: 0364-0213
Figure 1Graphical representation of a generative model of a sentence. If the indicator variable f has value 1, w is generated based on semantic relatedness to the sentence meaning m; otherwise, w is sampled from a trigram language model based on the immediately preceding two words.
Example sentence from each category. Identical homophone sentences contain phonetically ambiguous words that have identical homophones; near homophone sentences contain phonetically ambiguous words that have near homophones. Pun sentences were selected from a pun website; non‐pun sentences were selected from an online dictionary (see main text for details)
| Homophone | Type | Example |
|---|---|---|
| Identical | Pun | The magician was so mad he pulled his hare out. |
| Identical | Non‐pun | The hare ran rapidly across the field. |
| Identical | Non‐pun | Some people have lots of hair on their heads. |
| Near | Pun | A dentist has to tell a patient the whole tooth. |
| Near | Non‐pun | A dentist examines one tooth at a time. |
| Near | Non‐pun | She always speaks the truth. |
Figure 2Standard error ellipses of ambiguity and distinctiveness for each sentence type. Puns (both identical and near homophone) score higher on ambiguity and distinctiveness; non‐pun sentences score lower.
Regression coefficients using ambiguity and distinctiveness to predict funniness ratings for all 435 sentences; p‐values are computed assuming that the t statistic is approximately normally distributed
| Estimate |
|
| |
|---|---|---|---|
| Intercept | −2.139 | 0.306 | <.0001 |
| Ambiguity | 1.915 | 0.221 | <.0001 |
| Distinctiveness | 0.264 | 0.040 | <.0001 |
Figure 3Average funniness ratings and distinctiveness of 145 pun sentences binned according to distinctiveness quartiles. Error bars are confidence intervals.
Semantically relevant words, ambiguity/distinctiveness scores, and funniness ratings for sentences from each category. Words in boldface are semantically relevant to m ; words in italics are semantically relevant to m
|
|
| Type | Sentence | Amb. | Dist. | Funni. |
|---|---|---|---|---|---|---|
|
|
| Pun | The | 0.15 | 7.87 | 1.71 |
| Non | The | 1.43E−5 | 7.25 | −0.40 | ||
|
|
| Pun | A | 0.1 | 8.48 | 1.41 |
| Non | A | 8.92E−5 | 7.65 | −0.45 |