| Literature DB >> 23986735 |
Abstract
Community structure, which refers to the presence of densely connected groups within a larger network, is a common feature of several real-world networks from a variety of domains such as the human brain, social networks of hunter-gatherers and business organizations, and the World Wide Web (Porter et al., 2009). Using a community detection technique known as the Louvain optimization method, 17 communities were extracted from the giant component of the phonological network described in Vitevitch (2008). Additional analyses comparing the lexical and phonological characteristics of words in these communities against words in randomly generated communities revealed several novel discoveries. Larger communities tend to consist of short, frequent words of high degree and low age of acquisition ratings, and smaller communities tend to consist of longer, less frequent words of low degree and high age of acquisition ratings. Real communities also contained fewer different phonological segments compared to random communities, although the number of occurrences of phonological segments found in real communities was much higher than that of the same phonological segments in random communities. Interestingly, the observation that relatively few biphones occur very frequently and a large number of biphones occur rarely within communities mirrors the pattern of the overall frequency of words in a language (Zipf, 1935). The present findings have important implications for understanding the dynamics of activation spread among words in the phonological network that are relevant to lexical processing, as well as understanding the mechanisms that underlie language acquisition and the evolution of language.Entities:
Keywords: community structure; language acquisition; language evolution; lexical processing; mental lexicon; network science; phonology
Year: 2013 PMID: 23986735 PMCID: PMC3753538 DOI: 10.3389/fpsyg.2013.00553
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Community sizes for 17 communities extracted from the phonological network.
| 1 | 31 |
| 2 | 37 |
| 3 | 38 |
| 4 | 85 |
| 5 | 127 |
| 6 | 271 |
| 7 | 278 |
| 8 | 348 |
| 9 | 397 |
| 10 | 520 |
| 11 | 543 |
| 12 | 544 |
| 13 | 625 |
| 14 | 626 |
| 15 | 654 |
| 16 | 687 |
| 17 | 697 |
| MEAN | 382.82 |
| SD | 249.29 |
Summary of descriptive statistics for lexical characteristics of each real and random community.
| 1 | 4.452 | 0.961 | 5.634 | 1.475 | 1.576 | 0.617 | 2.258 | 1.094 | 1.687 | 0.433 | 0.04406 | 0.01522 | 0.00471 | 0.00236 | 10.533 | 2.818 |
| 2 | 4.946 | 0.780 | 5.289 | 1.808 | 1.489 | 0.756 | 2.649 | 1.549 | 1.373 | 0.422 | 0.05677 | 0.00902 | 0.00648 | 0.00168 | 10.393 | 3.227 |
| 3 | 4.342 | 0.745 | 4.818 | 1.960 | 1.207 | 0.322 | 2.342 | 1.146 | 1.230 | 0.236 | 0.04527 | 0.01288 | 0.00354 | 0.00216 | 11.781 | 2.378 |
| 4 | 5.129 | 0.985 | 6.132 | 1.190 | 1.583 | 0.675 | 2.918 | 1.642 | 1.673 | 0.562 | 0.05092 | 0.01014 | 0.00930 | 0.00340 | 10.538 | 3.032 |
| 5 | 4.654 | 0.938 | 5.689 | 1.531 | 1.442 | 0.549 | 2.984 | 1.830 | 1.514 | 0.434 | 0.06086 | 0.00991 | 0.00638 | 0.00185 | 10.597 | 3.063 |
| 6 | 3.686 | 0.799 | 6.093 | 1.387 | 1.712 | 0.848 | 10.635 | 8.669 | 1.830 | 0.408 | 0.03885 | 0.01346 | 0.00273 | 0.00217 | 8.898 | 3.112 |
| 7 | 3.878 | 0.823 | 6.210 | 1.286 | 1.677 | 0.828 | 10.227 | 7.770 | 1.722 | 0.405 | 0.04359 | 0.01075 | 0.00309 | 0.00152 | 8.215 | 2.814 |
| 8 | 4.422 | 0.787 | 6.014 | 1.422 | 1.644 | 0.701 | 6.017 | 4.751 | 1.706 | 0.421 | 0.04662 | 0.00993 | 0.00405 | 0.00168 | 9.374 | 3.073 |
| 9 | 4.128 | 0.972 | 5.875 | 1.562 | 1.857 | 0.866 | 8.814 | 7.783 | 1.989 | 0.538 | 0.05455 | 0.01470 | 0.00545 | 0.00285 | 9.028 | 3.389 |
| 10 | 4.123 | 0.865 | 5.810 | 1.539 | 1.629 | 0.791 | 7.656 | 6.677 | 1.695 | 0.461 | 0.04627 | 0.01218 | 0.00426 | 0.00271 | 9.347 | 3.100 |
| 11 | 3.825 | 0.855 | 5.956 | 1.512 | 1.729 | 0.792 | 11.449 | 9.546 | 1.815 | 0.429 | 0.04384 | 0.01450 | 0.00288 | 0.00214 | 8.878 | 3.414 |
| 12 | 4.268 | 1.029 | 5.915 | 1.485 | 1.688 | 0.769 | 5.044 | 4.436 | 1.787 | 0.498 | 0.05089 | 0.01668 | 0.00553 | 0.00348 | 9.697 | 3.129 |
| 13 | 4.160 | 0.959 | 6.013 | 1.415 | 1.711 | 0.842 | 9.483 | 8.449 | 1.795 | 0.479 | 0.05206 | 0.01299 | 0.00526 | 0.00271 | 8.953 | 3.118 |
| 14 | 3.682 | 0.928 | 5.956 | 1.425 | 1.865 | 0.918 | 11.229 | 8.763 | 1.948 | 0.476 | 0.04164 | 0.01355 | 0.00268 | 0.00192 | 9.091 | 3.341 |
| 15 | 4.142 | 0.854 | 6.060 | 1.351 | 1.663 | 0.792 | 9.096 | 8.236 | 1.744 | 0.457 | 0.04861 | 0.01490 | 0.00449 | 0.00291 | 8.847 | 3.080 |
| 16 | 3.905 | 0.903 | 6.072 | 1.407 | 1.860 | 0.897 | 10.531 | 9.121 | 1.944 | 0.493 | 0.04282 | 0.01468 | 0.00290 | 0.00229 | 8.829 | 3.311 |
| 17 | 4.022 | 0.891 | 6.006 | 1.412 | 1.827 | 0.852 | 11.389 | 9.456 | 1.953 | 0.469 | 0.04730 | 0.01216 | 0.00396 | 0.00259 | 8.944 | 3.098 |
| Overall | 4.058 | 0.937 | 5.974 | 1.448 | 1.734 | 0.828 | 9.100 | 8.289 | 1.822 | 0.483 | 0.04704 | 0.01430 | 0.00411 | 0.00281 | 9.106 | 3.206 |
| 1 | 4.161 | 0.820 | 5.489 | 1.749 | 1.782 | 0.967 | 7.032 | 5.666 | 1.774 | 0.432 | 0.04303 | 0.01659 | 0.00342 | 0.00287 | 10.576 | 3.374 |
| 2 | 3.946 | 0.575 | 5.773 | 1.659 | 1.697 | 0.905 | 9.703 | 8.553 | 1.705 | 0.375 | 0.04960 | 0.01334 | 0.00397 | 0.00207 | 8.370 | 2.661 |
| 3 | 4.079 | 0.997 | 5.794 | 1.590 | 1.737 | 0.769 | 9.184 | 8.577 | 1.925 | 0.604 | 0.05061 | 0.01329 | 0.00426 | 0.00304 | 8.813 | 3.615 |
| 4 | 4.012 | 0.893 | 6.163 | 1.342 | 1.733 | 0.900 | 8.129 | 7.773 | 1.873 | 0.567 | 0.04393 | 0.01509 | 0.00391 | 0.00270 | 8.724 | 3.056 |
| 5 | 4.213 | 0.879 | 5.875 | 1.499 | 1.695 | 0.804 | 6.906 | 6.560 | 1.855 | 0.500 | 0.04981 | 0.01342 | 0.00472 | 0.00281 | 9.427 | 3.091 |
| 6 | 3.993 | 1.004 | 5.965 | 1.454 | 1.711 | 0.824 | 9.745 | 8.767 | 1.838 | 0.468 | 0.04651 | 0.01421 | 0.00391 | 0.00251 | 8.877 | 3.006 |
| 7 | 4.047 | 0.984 | 5.914 | 1.452 | 1.738 | 0.849 | 8.727 | 8.101 | 1.848 | 0.483 | 0.04706 | 0.01462 | 0.00419 | 0.00290 | 9.129 | 3.329 |
| 8 | 4.126 | 0.940 | 5.885 | 1.486 | 1.674 | 0.812 | 8.580 | 8.053 | 1.797 | 0.488 | 0.04714 | 0.01464 | 0.00427 | 0.00283 | 9.031 | 3.198 |
| 9 | 4.058 | 0.904 | 5.956 | 1.469 | 1.727 | 0.846 | 9.164 | 8.443 | 1.812 | 0.508 | 0.04742 | 0.01402 | 0.00404 | 0.00274 | 8.948 | 3.211 |
| 10 | 4.071 | 0.960 | 6.007 | 1.416 | 1.751 | 0.872 | 9.269 | 8.397 | 1.814 | 0.482 | 0.04755 | 0.01483 | 0.00432 | 0.00288 | 9.166 | 3.221 |
| 11 | 4.064 | 0.965 | 5.978 | 1.460 | 1.860 | 0.907 | 9.637 | 8.710 | 1.832 | 0.492 | 0.04821 | 0.01441 | 0.00421 | 0.00293 | 8.917 | 3.278 |
| 12 | 4.053 | 0.926 | 5.924 | 1.465 | 1.692 | 0.774 | 9.158 | 8.286 | 1.778 | 0.442 | 0.04705 | 0.01414 | 0.00404 | 0.00277 | 9.142 | 3.122 |
| 13 | 4.080 | 0.913 | 5.998 | 1.477 | 1.719 | 0.793 | 9.421 | 8.421 | 1.853 | 0.478 | 0.04729 | 0.01399 | 0.00409 | 0.00282 | 9.071 | 3.102 |
| 14 | 4.061 | 0.927 | 5.979 | 1.451 | 1.720 | 0.810 | 8.941 | 8.137 | 1.801 | 0.477 | 0.04703 | 0.01445 | 0.00412 | 0.00278 | 9.178 | 3.178 |
| 15 | 4.034 | 0.964 | 6.045 | 1.386 | 1.761 | 0.824 | 8.821 | 8.088 | 1.850 | 0.486 | 0.04562 | 0.01451 | 0.00400 | 0.00284 | 9.219 | 3.249 |
| 16 | 4.051 | 0.920 | 5.993 | 1.450 | 1.728 | 0.824 | 9.403 | 8.362 | 1.810 | 0.474 | 0.04739 | 0.01424 | 0.00419 | 0.00303 | 9.129 | 3.304 |
| 17 | 4.030 | 0.938 | 5.998 | 1.413 | 1.713 | 0.799 | 9.019 | 8.287 | 1.826 | 0.494 | 0.04612 | 0.01372 | 0.00389 | 0.00255 | 9.159 | 3.238 |
| Overall | 4.058 | 0.937 | 5.974 | 1.448 | 1.734 | 0.828 | 9.100 | 8.289 | 1.822 | 0.483 | 0.04704 | 0.01430 | 0.00411 | 0.00281 | 9.106 | 3.206 |
Summary of statistical analyses for real and random communities.
| Word length | ||
| Familiarity | ||
| Word frequency | ||
| Neighborhood density | ||
| Neighborhood frequency | ||
| Positional probability | ||
| Biphone probability | ||
| Age of acquisition | ||
| Word length | ||
| Familiarity | ||
| Word frequency | ||
| Neighborhood density | ||
| Neighborhood frequency | ||
| Positional probability | ||
| Biphone probability | ||
| Age of acquisition | ||
(1) Corrected F-tests were conducted using corrected degrees of freedom if Levene's test of homogeneity of variances was significant. (2) The linear trend post-hoc contrast was conducted only if the omnibus F-test was statistically significant.
Figure 1Plots of mean lexical characteristics of each community against community sizes. The x-axis represents the number of words residing in each community. The y-axis represents the mean lexical characteristics for each of the 17 communities. The dashed line represents the best-fit line.
Summary of Kolmogorov-Smirnov tests for raw biphone counts of real and random communities.
| 1 | 0.366 | 0.001 |
| 2 | 0.288 | 0.001 |
| 3 | 0.287 | 0.007 |
| 4 | 0.296 | <0.001 |
| 5 | 0.309 | <0.001 |
| 6 | 0.175 | 0.002 |
| 7 | 0.174 | 0.001 |
| 8 | 0.136 | 0.005 |
| 9 | 0.149 | 0.002 |
| 10 | 0.107 | 0.025 |
| 11 | 0.077 | 0.191 |
| 12 | 0.119 | 0.004 |
| 13 | 0.127 | 0.004 |
| 14 | 0.066 | 0.295 |
| 15 | 0.127 | 0.002 |
| 16 | 0.086 | 0.077 |
| 17 | 0.071 | 0.202 |
p < 0.001,
p < 0.01,
p < 0.05,
p < 0.10.
Figure 2Raw biphone counts of real and random community 1. The x-axis represents the different biphones found within these communities and the biphones (on both x-axes) were arranged based on their frequency of occurrence in the real community in descending order.
Figure 3Raw biphone counts of real and random community 15. The x-axis represents the different biphones found within these communities and the biphones (on both x-axes) were arranged based on their frequency of occurrence in the real community in descending order.