| Literature DB >> 29489837 |
Heng Chen1, Xinying Chen2, Haitao Liu1,3,4.
Abstract
Language is a complex adaptive system, but how does it change? For investigating this process, four diachronic Chinese word co-occurrence networks have been built based on texts that were written during the last 2,000 years. By comparing the network indicators that are associated with the hierarchical features in language networks, we learn that the hierarchy of Chinese lexical networks has indeed evolved over time at three different levels. The connections of words at the micro level are continually weakening; the number of words in the meso-level communities has increased significantly; and the network is expanding at the macro level. This means that more and more words tend to be connected to medium-central words and form different communities. Meanwhile, fewer high-central words link these communities into a highly efficient small-world network. Understanding this process may be crucial for understanding the increasing structural complexity of the language system.Entities:
Mesh:
Year: 2018 PMID: 29489837 PMCID: PMC5830315 DOI: 10.1371/journal.pone.0192545
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Selected texts in the corpus.
| Corpus information | Corpus 1 | Corpus 2 | Corpus 3 | Corpus 4 |
|---|---|---|---|---|
| Ancient | Middle Ancient | Modern Times | Modern | |
| Name of the text | 孟子 | 颜氏家训书 | 宋元白话小说 | 新闻联播 |
| Size (words) | 10,000 | 10,000 | 10,000 | 10,000 |
| Total of word types | 1,162 | 1,945 | 1,421 | 1,370 |
| Total of characters | 1,165 | 12,640 | 13,457 | 17,854 |
| Time Period | B.C. 5–B.C. 3 | A.D. 4–A.D. 6 | A.D. 12–A.D. 14 | A.D. 21 |
Fig 1A word-form co-occurrence network based on an English paragraph.
Diachronic change of network parameters.
| Parameters | Network 1 | Network 2 | Network 3 | Network 4 |
|---|---|---|---|---|
| 1,527 | 3,034 | 2,227 | 2,526 | |
| 6.6051 | 4.2149 | 5.1612 | 4.7403 | |
| 6.6064 | 4.2373 | 5.1863 | 4.7593 | |
| 0.0022 | 0.0007 | 0.0012 | 0.0020 | |
| 0.0043 | 0.0014 | 0.0023 | 0.0019 | |
| 3.6553 | 4.2421 | 4.3153 | 4.7460 | |
| 4.0943 | 5.6825 | 4.8679 | 5.1763 | |
| 9 | 13 | 14 | 16 | |
| 7 | 13 | 9 | 10 | |
| 0.1542 | 0.0716 | 0.0681 | 0.0563 | |
| 0.0061 | 0.0016 | 0.0028 | 0.0013 | |
| 0.246 | 0.163 | 0.107 | 0.199 | |
| 0.0093 | 0.0109 | 0.0131 | 0.0088 | |
| 0.3481 | 0.4846 | 0.4345 | 0.4832 | |
| 0.3935 | 0.5152 | 0.4522 | 0.4768 | |
| 35.9567 | 57.6239 | 27.5611 | 28.2322 | |
| -1.173 | -1.270 | -1.353 | -1.449 | |
| 0.775 | 0.811 | 0.851 | 0.824 | |
| -0.418 | -0.396 | -0.185 | -0.355 | |
| 0.835 | 0.729 | 0.566 | 0.661 | |
| 0.956 | 0.454 | 0.304 | 0.292 | |
| 5.797 | 3.966 | 5.652 | 4.229 | |
| -0.609 | -0.576 | -0.375 | -0.543 | |
| -2.543 | -2.262 | -2.5 | -2.317 | |
| 0.803 | 0.688 | 0.564 | 0.571 | |
| 0.9951 | 0.9996 | 0.9996 |
Fig 2Power-law fitting (dual-log) of degree distributions of four diachronic Chinese networks.
Fig 3Power-law fitting (dual-log) of average degree of nearest neighbors of four diachronic Chinese networks.
Fig 4Evolution of the clustering coefficient (
Fig 5Power-law fitting of the degree-dependent clustering coefficients of four diachronic Chinese networks.
Fig 6The means of the top 10, the middle 10, and the final 10 degree-dependent clustering coefficients.
Fig 7MUL distributions (TP1 corresponds to Network 1 text).
The diachronic change of the MUL of Chinese (measured based on words) and the MWL of Chinese (measured based on characters).
| TP 1 | TP 2 | TP 3 | TP 4 | |
|---|---|---|---|---|
| MUL | 4.07 | 3.82 | 4.74 | 5.55 |
| MWL(static) | 1.17 | 1.26 | 1.35 | 1.79 |
| MWL(dynamic) | 1.48 | 1.66 | 1.71 | 2.01 |