| Literature DB >> 30662738 |
Bernat Corominas-Murtra1, Martí Sànchez Fibla2, Sergi Valverde3,4, Ricard Solé3,4,5.
Abstract
The emergence of syntax during childhood is a remarkable example of how complex correlations unfold in nonlinear ways through development. In particular, rapid transitions seem to occur as children reach the age of two, which seems to separate a two-word, tree-like network of syntactic relations among words from the scale-free graphs associated with the adult, complex grammar. Here, we explore the evolution of syntax networks through language acquisition using the chromatic number, which captures the transition and provides a natural link to standard theories on syntactic structures. The data analysis is compared to a null model of network growth dynamics which is shown to display non-trivial and sensible differences. At a more general level, we observe that the chromatic classes define independent regions of the graph, and thus, can be interpreted as the footprints of incompatibility relations, somewhat as opposed to modularity considerations.Entities:
Keywords: complex networks; graph colouring; modularity; syntax
Year: 2018 PMID: 30662738 PMCID: PMC6304139 DOI: 10.1098/rsos.181286
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 2.Evolution of the chromatic number χ (solid line), maximum clique ω (dashed line with crosses) and the maximum core (dashed line with squares) in Peter’s corpus (a) and Carl’s corpus (b). Evolution of the same measures over an ensemble of n = 20 networks obtained after running the null model fed with Peter’s corpus data (c) and Carl’s corpus (d). Insets: comparison between the time evolution of the deepest K-core, -core, size (dashed line)—i.e. number of nodes within this subgraph—and the corresponding to the deepest K-core (solid line) for each set of networks. Shaded grey areas correspond to standard deviation in the case of the simulated instances.
Figure 4.Relationship between the chromatic number, mean degree and giant connected component (GCC), comparing real data from the null model. (a) Scatter plot between the chromatic number and the average connectivity of the networks for Peter’s corpus and its associated null model. (b) Scatter plot between the chromatic number and the average connectivity of the networks for Carl’s corpus and its associated null model. (c) Scatter plot between the chromatic number and the size of the GCC for Peter’s corpus and its associated null model. (d) Scatter plot between the chromatic number and the size of the GCC for Carl’s corpus and its associated null model. Clearly, all indicators show that the chromatic number observed in real graphs is below that expected by a non-syntactic speaker.
Figure 1.Optimal colourings of syntactic networks before and after the syntactic spurt. (a) A syntactic network before the transition (3rd corpus) is largely bipartite (this network accepts a 2-colouring). (b) Post-transition network (7th corpus) is remarkably more complex, which corresponds to high chromatic number . All networks coming from Peter dataset. Time spent between these two corpora is about two and a half months—see text.
Relative energy values of q-colourings in the Peter (top) and Carl (bottom) datasets. Relative energies reveal the fraction of frustrated links in the optimal colouring using q different colours.
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| 0 | 0 | 0 | 1/49 | 5/105 | 66/434 | 131/644 | 87/589 | 157/903 | 104/659 | 95/717 | |
| 0 | 0 | 0 | 0 | 0 | 8/434 | 31/644 | 15/589 | 40/903 | 20/659 | 10/717 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 8/644 | 0 | 8/903 | 2/659 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 1/644 | 0 | 0 | 0 | 0 |
Figure 3.Evolution of (a) the mean degree and (b) size of the largest connected component in the real (strong solid lines) and simulated (weak solid lines) syntax networks. Shaded grey areas correspond to standard deviation in the case of the simulated instances.