Literature DB >> 20334720

Morphosyntactic annotation of CHILDES transcripts.

Kenji Sagae1, Eric Davis, Alon Lavie, Brian Macwhinney, Shuly Wintner.   

Abstract

Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database with grammatical relations in the form of labeled dependency structures. We have produced a corpus of over 18,800 utterances (approximately 65,000 words) with manually curated gold-standard grammatical relation annotations. Using this corpus, we have developed a highly accurate data-driven parser for the English CHILDES data, which we used to automatically annotate the remainder of the English section of CHILDES. We have also extended the parser to Spanish, and are currently working on supporting more languages. The parser and the manually and automatically annotated data are freely available for research purposes.

Entities:  

Mesh:

Year:  2010        PMID: 20334720      PMCID: PMC4048841          DOI: 10.1017/S0305000909990407

Source DB:  PubMed          Journal:  J Child Lang        ISSN: 0305-0009


  3 in total

1.  Automatic disambiguation of morphosyntax in spoken language corpora.

Authors:  C Parisse; M T Le Normand
Journal:  Behav Res Methods Instrum Comput       Date:  2000-08

2.  From exemplar to grammar: a probabilistic analogy-based model of language learning.

Authors:  Rens Bod
Journal:  Cogn Sci       Date:  2009-04-08

3.  Children's grammars grow more abstract with age--evidence from an automatic procedure for identifying the productive units of language.

Authors:  Gideon Borensztajn; Willem Zuidema; Rens Bod
Journal:  Top Cogn Sci       Date:  2009-01
  3 in total
  9 in total

1.  Understanding spoken language through TalkBank.

Authors:  Brian MacWhinney
Journal:  Behav Res Methods       Date:  2019-08

2.  Using Computerized Language Analysis to Evaluate Grammatical Skills.

Authors:  Lizbeth H Finestack; Bobbi Rohwer; Lisa Hilliard; Leonard Abbeduto
Journal:  Lang Speech Hear Serv Sch       Date:  2020-04-07       Impact factor: 2.983

3.  The Hebrew CHILDES corpus: transcription and morphological analysis.

Authors:  Aviad Albert; Brian MacWhinney; Bracha Nir; Shuly Wintner
Journal:  Lang Resour Eval       Date:  2013-12-01       Impact factor: 1.358

4.  How children explore the phonological network in child-directed speech: A survival analysis of children's first word productions.

Authors:  Matthew T Carlson; Morgan Sonderegger; Max Bane
Journal:  J Mem Lang       Date:  2014-08       Impact factor: 3.059

5.  AphasiaBank: Methods for Studying Discourse.

Authors:  Brian Macwhinney; Davida Fromm; Margaret Forbes; Audrey Holland
Journal:  Aphasiology       Date:  2011-09-22       Impact factor: 2.773

6.  Using Free Computer-Assisted Language Sample Analysis to Evaluate and Set Treatment Goals for Children Who Speak African American English.

Authors:  Courtney Overton; Taylor Baron; Barbara Zurer Pearson; Nan Bernstein Ratner
Journal:  Lang Speech Hear Serv Sch       Date:  2021-01-18       Impact factor: 2.983

7.  Netlang: A software for the linguistic analysis of corpora by means of complex networks.

Authors:  Lluís Barceló-Coblijn; Diego Serna Salazar; Gustavo Isaza; Luis F Castillo Ossa; Manuel G Bedia
Journal:  PLoS One       Date:  2017-08-23       Impact factor: 3.240

8.  Tracking Child Language Development With Neural Network Language Models.

Authors:  Kenji Sagae
Journal:  Front Psychol       Date:  2021-07-08

9.  Computational evaluation of the Traceback Method.

Authors:  Sheli Kol; Bracha Nir; Shuly Wintner
Journal:  J Child Lang       Date:  2013-01-24
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.