| Literature DB >> 23658625 |
Yair Neuman1, Dan Assaf, Yohai Cohen, Mark Last, Shlomo Argamon, Newton Howard, Ophir Frieder.
Abstract
Identifying metaphorical language-use (e.g., sweet child) is one of the challenges facing natural language processing. This paper describes three novel algorithms for automatic metaphor identification. The algorithms are variations of the same core algorithm. We evaluate the algorithms on two corpora of Reuters and the New York Times articles. The paper presents the most comprehensive study of metaphor identification in terms of scope of metaphorical phrases and annotated corpora size. Algorithms' performance in identifying linguistic phrases as metaphorical or literal has been compared to human judgment. Overall, the algorithms outperform the state-of-the-art algorithm with 71% precision and 27% averaged improvement in prediction over the base-rate of metaphors in the corpus.Entities:
Mesh:
Year: 2013 PMID: 23658625 PMCID: PMC3639214 DOI: 10.1371/journal.pone.0062343
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The Reuters Corpus – Number of Annotated Expressions.
| Expression Type | |||
| Target word | I | II | III |
| Father | 17 | 116 | 62 |
| God | 5 | 19 | 13 |
| Governance | 0 | 5 | 13 |
| Government | 86 | 277 | 613 |
| Mother | 12 | 81 | 59 |
|
|
|
|
|
Annotators’ Decision – Reuters.
| Expression Type | |||
| Annotators’ decision | I | II | III |
| Literal | 40 (33.3%) | 242 (48.6%) | 576 (75.8%) |
| Metaphorical | 80 (66.7%) | 256 (51.4%) | 184 (24.2%) |
Annotators’ Decision – NYT.
| Expression Type | |||
| Annotators’ decision | I | II | III |
| Literal | 129 (47.6%) | 189 (63%) | 272 (63%) |
| Metaphorical | 142 (52.4%) | 111 (37%) | 160 (37%) |
Results – Reuters.
| Expression type | A priori probability | Precision | 95% CI | Recall |
| I | 66.7% | 83.9% | 75–90% | 97.5% |
| II | 51.4% | 76.1% | 70–80% | 82% |
| III | 24.2% | 54.4% | 45–61% | 43.5% |
Comparative Results – Reuters.
| Expression type | Precision CCO | Precision Con-Abs | Recall CCO | Recall Con-Abs |
| I | 83.9% | 76.53% | 97.5% | 76.53% |
| II | 76.1% | 63.87 | 82% | 67.2% |
Results – NYT.
| Expression type | A priori probability | Precision | 95% CI | Recall |
| I | 52.4% | 84.1% | 77–89% | 85.9% |
| II | 37% | 62% | 54–69% | 83.8% |
| III | 37% | 69.8% | 63–75% | 88.1% |
Comparative Results – NYT.
| Metaphor type | Precision CCO | Precision Con-Abs | Recall CCO | Recall Con-Abs |
| I | 84.1% | 65.6% | 85.9% | 72.5% |
| II | 62% | 50% | 83.8% | 5% |