Literature DB >> 30895456

SUBTLEX-CAT: Subtitle word frequencies and contextual diversity for Catalan.

Roger Boada1, Marc Guasch2, Juan Haro2, Josep Demestre2, Pilar Ferré2.   

Abstract

SUBTLEX-CAT is a word frequency and contextual diversity database for Catalan, obtained from a 278-million-word corpus based on subtitles supplied from broadcast Catalan television. Like all previous SUBTLEX corpora, it comprises subtitles from films and TV series. In addition, it includes a wider range of TV shows (e.g., news, documentaries, debates, and talk shows) than has been included in most previous databases. Frequency metrics were obtained for the whole corpus, on the one hand, and only for films and fiction TV series, on the other. Two lexical decision experiments revealed that the subtitle-based metrics outperformed the previously available frequency estimates, computed from either written texts or texts from the Internet. Furthermore, the metrics obtained from the whole corpus were better predictors than the ones obtained from films and fiction TV series alone. In both experiments, the best predictor of response times and accuracy was contextual diversity.

Entities:  

Keywords:  Catalan language; Contextual diversity; Subtitles; Word frequency

Mesh:

Year:  2020        PMID: 30895456     DOI: 10.3758/s13428-019-01233-1

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  21 in total

1.  Recognition memory for 2,578 monosyllabic words.

Authors:  Michael J Cortese; Maya M Khanna; Sarah Hacker
Journal:  Memory       Date:  2010-07-30

2.  Contextual diversity, not word frequency, determines word-naming and lexical decision times.

Authors:  James S Adelman; Gordon D A Brown; José F Quesada
Journal:  Psychol Sci       Date:  2006-09

3.  Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.

Authors:  Marc Brysbaert; Boris New
Journal:  Behav Res Methods       Date:  2009-11

Review 4.  The word frequency effect: a review of recent developments and implications for the choice of frequency estimates in German.

Authors:  Marc Brysbaert; Matthias Buchmeier; Markus Conrad; Arthur M Jacobs; Jens Bölte; Andrea Böhl
Journal:  Exp Psychol       Date:  2011

5.  Facilitative effect of cognate words vanishes when reducing the orthographic overlap: The role of stimuli list composition.

Authors:  Montserrat Comesaña; Pilar Ferré; Joaquín Romero; Marc Guasch; Ana P Soares; Teófilo García-Chico
Journal:  J Exp Psychol Learn Mem Cogn       Date:  2014-10-20       Impact factor: 3.051

6.  EsPal: one-stop shopping for Spanish word properties.

Authors:  Andrew Duchon; Manuel Perea; Nuria Sebastián-Gallés; Antonia Martí; Manuel Carreiras
Journal:  Behav Res Methods       Date:  2013-12

7.  Losing control of your languages: a case study.

Authors:  Marco Calabria; Paula Marne; Lucía Romero-Pinel; Montserrat Juncadella; Albert Costa
Journal:  Cogn Neuropsychol       Date:  2014-02-05       Impact factor: 2.468

8.  On the overlap between bilingual language control and domain-general executive control.

Authors:  Francesca M Branzi; Marco Calabria; Maria Lucrezia Boscarino; Albert Costa
Journal:  Acta Psychol (Amst)       Date:  2016-04-01

9.  "Aberrant" MHC class II expression in epithelia.

Authors:  M Moore; A K Ghosh
Journal:  Lancet       Date:  1987-01-17       Impact factor: 79.321

10.  Subtitle-based word frequencies as the best estimate of reading behavior: the case of greek.

Authors:  Maria Dimitropoulou; Jon Andoni Duñabeitia; Alberto Avilés; José Corral; Manuel Carreiras
Journal:  Front Psychol       Date:  2010-12-21
View more
  1 in total

1.  Prevalence norms for 40,777 Catalan words: An online megastudy of vocabulary size.

Authors:  Marc Guasch; Roger Boada; Jon Andoni Duñabeitia; Pilar Ferré
Journal:  Behav Res Methods       Date:  2022-09-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.