Literature DB >> 16698933

Hierarchical structures induce long-range dynamical correlations in written texts.

E Alvarez-Lacalle1, B Dorow, J-P Eckmann, E Moses.   

Abstract

Thoughts and ideas are multidimensional and often concurrent, yet they can be expressed surprisingly well sequentially by the translation into language. This reduction of dimensions occurs naturally but requires memory and necessitates the existence of correlations, e.g., in written text. However, correlations in word appearance decay quickly, while previous observations of long-range correlations using random walk approaches yield little insight on memory or on semantic context. Instead, we study combinations of words that a reader is exposed to within a "window of attention," spanning about 100 words. We define a vector space of such word combinations by looking at words that co-occur within the window of attention, and analyze its structure. Singular value decomposition of the co-occurrence matrix identifies a basis whose vectors correspond to specific topics, or "concepts" that are relevant to the text. As the reader follows a text, the "vector of attention" traces out a trajectory of directions in this "concept space." We find that memory of the direction is retained over long times, forming power-law correlations. The appearance of power laws hints at the existence of an underlying hierarchical network. Indeed, imposing a hierarchy similar to that defined by volumes, chapters, paragraphs, etc. succeeds in creating correlations in a surrogate random text that are identical to those of the original text. We conclude that hierarchical structures in text serve to create long-range correlations, and use the reader's memory in reenacting some of the multidimensionality of the thoughts being expressed.

Mesh:

Year:  2006        PMID: 16698933      PMCID: PMC1472411          DOI: 10.1073/pnas.0510673103

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  7 in total

1.  Language as an evolving word web.

Authors:  S N Dorogovtsev; J F Mendes
Journal:  Proc Biol Sci       Date:  2001-12-22       Impact factor: 5.349

2.  Curvature of co-links uncovers hidden thematic layers in the World Wide Web.

Authors:  Jean-Pierre Eckmann; Elisha Moses
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-23       Impact factor: 11.205

3.  Hierarchical organization in complex networks.

Authors:  Erzsébet Ravasz; Albert-László Barabási
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2003-02-14

4.  Entropy of dialogues creates coherent structures in e-mail traffic.

Authors:  Jean-Pierre Eckmann; Elisha Moses; Danilo Sergi
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-24       Impact factor: 11.205

5.  Long-range correlations in nucleotide sequences.

Authors:  C K Peng; S V Buldyrev; A L Goldberger; S Havlin; F Sciortino; M Simons; H E Stanley
Journal:  Nature       Date:  1992-03-12       Impact factor: 49.962

6.  The small world of human language.

Authors:  R Ferrer I Cancho; R V Solé
Journal:  Proc Biol Sci       Date:  2001-11-07       Impact factor: 5.349

Review 7.  Computational and evolutionary aspects of language.

Authors:  Martin A Nowak; Natalia L Komarova; Partha Niyogi
Journal:  Nature       Date:  2002-06-06       Impact factor: 49.962

  7 in total
  13 in total

1.  On the origin of long-range correlations in texts.

Authors:  Eduardo G Altmann; Giampaolo Cristadoro; Mirko Degli Esposti
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-02       Impact factor: 11.205

2.  Fractals in the nervous system: conceptual implications for theoretical neuroscience.

Authors:  Gerhard Werner
Journal:  Front Physiol       Date:  2010-07-06       Impact factor: 4.566

3.  The dynamics of memory retrieval in hierarchical networks.

Authors:  Yifan Gu; Pulin Gong
Journal:  J Comput Neurosci       Date:  2016-02-27       Impact factor: 1.621

4.  Languages cool as they expand: allometric scaling and the decreasing need for new words.

Authors:  Alexander M Petersen; Joel N Tenenbaum; Shlomo Havlin; H Eugene Stanley; Matjaž Perc
Journal:  Sci Rep       Date:  2012-12-10       Impact factor: 4.379

5.  Universal entropy of word ordering across linguistic families.

Authors:  Marcelo A Montemurro; Damián H Zanette
Journal:  PLoS One       Date:  2011-05-13       Impact factor: 3.240

6.  Communication patterns in a psychotherapy following traumatic brain injury: a quantitative case study based on symbolic dynamics.

Authors:  Paul E Rapp; Christopher J Cellucci; Adele M K Gilpin; Miguel A Jiménez-Montaño; Kathryn E Korslund
Journal:  BMC Psychiatry       Date:  2011-07-27       Impact factor: 3.630

7.  Statistical laws governing fluctuations in word use from word birth to word death.

Authors:  Alexander M Petersen; Joel Tenenbaum; Shlomo Havlin; H Eugene Stanley
Journal:  Sci Rep       Date:  2012-03-15       Impact factor: 4.379

8.  Complexity-entropy analysis at different levels of organisation in written language.

Authors:  Ernesto Estevez-Rams; Ania Mesa-Rodriguez; Daniel Estevez-Moya
Journal:  PLoS One       Date:  2019-05-08       Impact factor: 3.240

9.  Modeling statistical properties of written text.

Authors:  M Angeles Serrano; Alessandro Flammini; Filippo Menczer
Journal:  PLoS One       Date:  2009-04-29       Impact factor: 3.240

10.  Beyond word frequency: bursts, lulls, and scaling in the temporal distributions of words.

Authors:  Eduardo G Altmann; Janet B Pierrehumbert; Adilson E Motter
Journal:  PLoS One       Date:  2009-11-11       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.