| Literature DB >> 35529579 |
Felix Haiduk1, W Tecumseh Fitch1,2.
Abstract
Music and spoken language share certain characteristics: both consist of sequences of acoustic elements that are combinatorically combined, and these elements partition the same continuous acoustic dimensions (frequency, formant space and duration). However, the resulting categories differ sharply: scale tones and note durations of small integer ratios appear in music, while speech uses phonemes, lexical tone, and non-isochronous durations. Why did music and language diverge into the two systems we have today, differing in these specific features? We propose a framework based on information theory and a reverse-engineering perspective, suggesting that design features of music and language are a response to their differential deployment along three different continuous dimensions. These include the familiar propositional-aesthetic ('goal') and repetitive-novel ('novelty') dimensions, and a dialogic-choric ('interactivity') dimension that is our focus here. Specifically, we hypothesize that music exhibits specializations enhancing coherent production by several individuals concurrently-the 'choric' context. In contrast, language is specialized for exchange in tightly coordinated turn-taking-'dialogic' contexts. We examine the evidence for our framework, both from humans and non-human animals, and conclude that many proposed design features of music and language follow naturally from their use in distinct dialogic and choric communicative contexts. Furthermore, the hybrid nature of intermediate systems like poetry, chant, or solo lament follows from their deployment in the less typical interactive context.Entities:
Keywords: animal communication; choric; dialogic; information theory; language; music
Year: 2022 PMID: 35529579 PMCID: PMC9075586 DOI: 10.3389/fpsyg.2022.786899
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Design features differing between language and music, updated from Fitch (2006).
| Design Feature | Language | Music | Definition | ||
|---|---|---|---|---|---|
|
|
|
|
| ||
| Vocal auditory channel | + | − | + | − | Signal sequences are patterns of sounds produced by the vocal tract and articulators |
| Broadcast transmission | + | +? | + | + | Signal sequences are detectable by anyone within given distance/line of sight |
| Rapid fading | + | + | + | + | Signal sequences dissipate when signalling stops |
| Interchangeability | + | + | + | − | Individuals can be both sender and receiver |
| Total feedback | + | + | + | +? | Senders themselves perceive what they signal |
| Specialisation | + | + | + | + | A signal sequence does not directly trigger a specific behaviour in the receiver |
| Productivity | + | + | + | + | Ability to produce novel signal sequences |
| Discreteness | + | + | + | + | Signalling units are functionally distinct |
| Cultural transmission | + | + | + | + | The signalling system is transmitted between individuals |
| Movement | + | + | + | + | Movements of body (−parts) accompany movements that create the signal itself |
| Transposability | + | + | + | + | The relationships between signal units rather than absolute features identify a signal sequence (a sentence is considered the same regardless of who spoke/signed it, a melody regardless of instrument, voice or absolute pitch) |
| Duality of Patterning | + | + | − | − | Signal sequences can be analysed both as units of signalling (cenemes) and meaning-bearing units (pleremes) |
| Generativity | + | + | + | + | Signal units are recombined according to rules |
| Semanticity | + | + | − | − | Fixed associations exist between meaning-bearing units and states or properties of the world/environment |
| Arbitrariness | + | + | − | − | The content of most meaning-bearing units is unrelated to features of signalling units |
| Displacement | + | + | − | − | Meaning-bearing units refer to entities outside their spatial and temporal context |
| Discrete pitches | − | − | + | + | Allowed pitches are based on a scale of tones related by intervals |
| Isochronic | − | − | + | + | Regular periodic pulse providing a reference framework for other temporal features of the signal sequence |
| Performative context | − | − | + | + | Classes of signal sequences (e.g. songs or styles) recur in specific social contexts |
| Repeatable (repertoire) | − | − | + | + | Signal sequences are distinguishable (pieces), exactly repeatable and repeated in certain contexts |
| A-referentially expressive | − | +? | + | + | Higher order relations of a signal sequence are cognitively mapped to movement and affective responses |
These design features concern speech (including sign) or musical acts that we label as ‘typical’, e.g., spoken conversations or musical ensemble playing. V = vocal, S = signed, I = instrumental.
Sensorimotor.
Added by the authors.
Overview of the three proposed dimensions of our framework, with examples from music, language, and animal communication.
| Dimension | Pole 1 | Pole 2 | ||
|---|---|---|---|---|
| Name | Example | Name | Example | |
| Goal | Propositional | Discussing the week’s events with a friend Singing ‘Happy Birthday’ | Aesthetic | Shakespeare sonnetsListening to your favourite Beatles album |
| Novelty | Novelty | Listening to a conference talkVariation and recombination of melodic modules in BaAka music ( | Repetition | Word repetition for emphasis (‘I did not break the dish. I did not break the dish. I repeat, I did not break the dish’)Choruses in songs |
| Interactivity | Choric | Religious ensemble chantingEnsemble musicPlain-tailed wren mating display (within sex) | Dialogic | Conversational speechCall-and-response songAnimal antiphonal calling |
Assumptions and measures of information theory.
| Assumption | Measure/method | References |
|---|---|---|
| Information is an adequate model of prediction, plausible to happen in the brain | Predictive coding and similar accounts | |
| Entropy and information can be measured at multiple levels of the signal sequence concurrently, and their interaction can be modelled | Models based on statistical learning and using a multiple viewpoint approach | |
| The information/entropy trajectories of the different levels can be compared | Mutual information measures for multivariate time series (transfer entropy, partial information decomposition, etc.) | |
| Context (e.g., discourse context, conceptual knowledge, etc.) can be modelled using information theory | Conditional entropy (e.g., with | |
| Information theory can be applied to both discrete or continuous (or discretisable) sequences, e.g., for body movement and gesturing | Discretisation of continuous signals |