Literature DB >> 30283378

Long-Range Correlation Underlying Childhood Language and Generative Models.

Kumiko Tanaka-Ishii1.   

Abstract

Long-range correlation, a property of time series exhibiting relevant statistical dependence between two distant subsequences, is mainly studied in the statistical physics domain and has been reported to exist in natural language. By using a state-of-the-art method for such analysis, long-range correlation is first shown to occur in long CHILDES data sets. To understand why, generative stochastic models of language, originally proposed in the cognitive scientific domain, are investigated. Among representative models, the Simon model is found to exhibit surprisingly good long-range correlation, but not the Pitman-Yor model. Because the Simon model is known not to correctly reflect the vocabulary growth of natural languages, a simple new model is devised as a conjunct of the Simon and Pitman-Yor models, such that long-range correlation holds with a correct vocabulary growth rate. The investigation overall suggests that uniform sampling is one cause of long-range correlation and could thus have some relation with actual linguistic processes.

Entities:  

Keywords:  CHILDES; Pitman-Yor model; Simon Model; fluctuation analysis; generative models; long-range correlation

Year:  2018        PMID: 30283378      PMCID: PMC6157415          DOI: 10.3389/fpsyg.2018.01725

Source DB:  PubMed          Journal:  Front Psychol        ISSN: 1664-1078


  15 in total

1.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

2.  Reexamining the vocabulary spurt.

Authors:  Jennifer Ganger; Michael R Brent
Journal:  Dev Psychol       Date:  2004-07

3.  Renormalization-group transformations and correlations of seismicity.

Authors:  Alvaro Corral
Journal:  Phys Rev Lett       Date:  2005-07-07       Impact factor: 9.161

4.  Effect of nonlinear correlations on the statistics of return intervals in multifractal data sets.

Authors:  Mikhail I Bogachev; Jan F Eichner; Armin Bunde
Journal:  Phys Rev Lett       Date:  2007-12-10       Impact factor: 9.161

5.  A Bayesian framework for word segmentation: exploring the effects of context.

Authors:  Sharon Goldwater; Thomas L Griffiths; Mark Johnson
Journal:  Cognition       Date:  2009-05-05

6.  Long-Range Memory in Literary Texts: On the Universal Clustering of the Rare Words.

Authors:  Kumiko Tanaka-Ishii; Armin Bunde
Journal:  PLoS One       Date:  2016-11-28       Impact factor: 3.240

7.  Modeling statistical properties of written text.

Authors:  M Angeles Serrano; Alessandro Flammini; Filippo Menczer
Journal:  PLoS One       Date:  2009-04-29       Impact factor: 3.240

8.  Beyond word frequency: bursts, lulls, and scaling in the temporal distributions of words.

Authors:  Eduardo G Altmann; Janet B Pierrehumbert; Adilson E Motter
Journal:  PLoS One       Date:  2009-11-11       Impact factor: 3.240

9.  The evolution of the exponent of Zipf's law in language ontogeny.

Authors:  Jaume Baixeries; Brita Elvevåg; Ramon Ferrer-i-Cancho
Journal:  PLoS One       Date:  2013-03-13       Impact factor: 3.240

10.  Long-range correlation properties in timing of skilled piano performance: the influence of auditory feedback and deep brain stimulation.

Authors:  María Herrojo Ruiz; Sang Bin Hong; Holger Hennig; Eckart Altenmüller; Andrea A Kühn
Journal:  Front Psychol       Date:  2014-09-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.