| Literature DB >> 26384237 |
Heng Chen1, Junying Liang2, Haitao Liu3.
Abstract
We demonstrate a substantial evidence that the word length can be an essential lexical structural feature for word evolution in written Chinese. The data used in this study are diachronic Chinese short narrative texts with a time span of over 2000-years. We show that the increase of word length is an essential regularity in word evolution. On the one hand, word frequency is found to depend on word length, and their relation is in line with the Power law function y = ax-b. On the other hand, our deeper analyses show that the increase of word length results in the simplification in characters for balance in written Chinese. Moreover, the correspondence between written and spoken Chinese is discussed. We conclude that the disyllabic trend may account for the increase of word length, and its impacts can be explained in "the principle of least effort".Entities:
Mesh:
Year: 2015 PMID: 26384237 PMCID: PMC4575206 DOI: 10.1371/journal.pone.0138567
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Static Word length distributions of each time period with different texts scales (N = 1000, N = 2000 and N = 3000 characters; statistics for the sample in each time period can be found in S1 File).
Fig 2Linear fittings to static word probability changes of each word length class for texts with N = 1000.
Fig 3Dynamic Word length distributions of each time period with different texts scales.
Fig 4Dynamic and static mean word length evolution for different text scales.
Fig 5Fitting y = ax-b to the relation between word length and type-token ratio for each time period with N = 10000.