| Literature DB >> 35787184 |
Jordan Kodner1, Spencer Caplan2, Charles Yang3.
Abstract
Entities:
Mesh:
Year: 2022 PMID: 35787184 PMCID: PMC9303892 DOI: 10.1073/pnas.2204664119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.The apparent success of an inadequate model: Performance of a trigram model on natural and artificial languages from the published literature; see figure 2 of ref. 1 for references. Based on YP’s evaluation scheme, the trigram model performs well on several of the languages, even outperforming YP on some (e.g., ref. 6). “Total P” measures performance of the model against the entire string set rather than just the top 25, which more accurately corresponds to grammar learning. This is clearest for Saffran, which was designed to be learnable by a bigram model. The n-gram model learns it perfectly but is unfairly penalized under YP’s evaluation because it generates strings that are in the language but outside the 25 most probable ones. All language data are from https://github.com/piantado/Fleet.