| Literature DB >> 35413059 |
Artjoms Šeļa1,2, Petr Plecháč3, Alie Lassche4.
Abstract
Recent advances in cultural analytics and large-scale computational studies of art, literature and film often show that long-term change in the features of artistic works happens gradually. These findings suggest that conservative forces that shape creative domains might be underestimated. To this end, we provide the first large-scale formal evidence of the association between poetic meter and semantics in 18-19th century European literatures, using Czech, German and Russian collections with additional data from English poetry and early modern Dutch songs. Our study traces this association through a series of unsupervised classifications using the abstracted semantic features of poems that are inferred for individual texts with the aid of topic modeling. Topics alone enable recognition of the meters in each observed language, as may be seen from the same-meter samples clustering together (median Adjusted Rand Index between 0.48 and 1 across traditions). In addition, this study shows that the strength of the association between form and meaning tends to decrease over time. This may reflect a shift in aesthetic conventions between the 18th and 19th centuries as individual innovation was increasingly favored in literature. Despite this decline, it remains possible to recognize semantics of the meters from past or future, which suggests the continuity in meter-meaning relationships while also revealing the historical variability of conditions across languages. This paper argues that distinct metrical forms, which are often copied in a language over centuries, also maintain long-term semantic inertia in poetry. Our findings highlight the role of the formal features of cultural items in influencing the pace and shape of cultural evolution.Entities:
Mesh:
Year: 2022 PMID: 35413059 PMCID: PMC9004753 DOI: 10.1371/journal.pone.0266556
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Examples of accentual-syllabic metrical types and labeling strategies.
S—denotes a strong position in the foot (stress expected), W—weak.
| Meter | Foot | Pattern | Metrical Type | Label |
|---|---|---|---|---|
| Iamb | WS | WS|WS|WS|WS|WS | iambic pentameter | I5 |
| Trochee | SW | SW|SW|SW|S(W) | trochaic tetrameter | T4 |
| Dactyl | SWW | SWW|SWW|SWW|S(WW) | dactylic tetrameter | D4 |
| Amphibrach | WSW | WSW|WSW|WSW|WS(W) | amphibrachic tetrameter | A4 |
| Anapest | WWS | WWS|WWS | anapestic dimeter | An2 |
Fig 1Random 100-poem samples taken without replacement per meter in vector spaces defined by LDA topic models.
(A) Adjusted Rand Index of k-means clustering (whiskers give the 5th- to 95th-percentile range). 10,000 random samplings. Crosses show the ARI of the samplings presented in PCA biplots. PCA biplots of (B) Czech (8 meters), (C) German (4 meters), (D) Russian (4 meters), (E) Dutch (4 meters) and (F) English data (2 meters) respectively with eigenvectors for the 5 most contributing topics. Single random sampling.
Fig 2Accuracy of SVM classifications.
Predicting meter with vectors defined by topic probabilities in random samples of poems (sample size ∈{1, 5}∪{10, 20, …, 100}). (1) trained and tested on the entire dataset (leave-one-out cross-validation), (2) trained on earlier data and tested on later data, (3) trained on later data and tested on earlier data. 10,000 iterations. (A) Czech, (B) German, (C) Russian, (D) Dutch, (E) English.
Adjusted Rand Index of k-means clustering in different periods (random 100-poem samples).
10,000 iterations.
| Language | Time span | ARI | Meters | # of samples per meter | |
|---|---|---|---|---|---|
| mean | std. dev. | ||||
| Czech | 1800–1859 | 0.994 | 0.034 | I5, T4, T5 | 5 |
| German | 1750–1824 | 0.958 | 0.098 | I4, I5, T4 | 5 |
| Russian | 1800–1859 | 0.715 | 0.181 | I4, I5, T4 | 5 |
| Dutch | 1550–1649 | 1 | 0 | I3, I5, T4 | 4 |
Summary of poetry corpora used.
| Language | Texts | Period | Tokens |
|---|---|---|---|
| Czech | 69,760 | 18–20th c. | 13,100,898 |
| German | 53,608 | 16–20th c. | 10,462,211 |
| Russian | 17,900 | 18–19th c. | 3,329,352 |
| Dutch | 22,297 | 1550–1750 | 6,562,888 |
| English | 6,448 | 16–19th c. | 2,126,436 |