Literature DB >> 25419199

The Hebrew CHILDES corpus: transcription and morphological analysis.

Aviad Albert1, Brian MacWhinney2, Bracha Nir3, Shuly Wintner4.   

Abstract

We present a corpus of transcribed spoken Hebrew that reflects spoken interactions between children and adults. The corpus is an integral part of the CHILDES database, which distributes similar corpora for over 25 languages. We introduce a dedicated transcription scheme for the spoken Hebrew data that is sensitive to both the phonology and the standard orthography of the language. We also introduce a morphological analyzer that was specifically developed for this corpus. The analyzer adequately covers the entire corpus, producing detailed correct analyses for all tokens. Evaluation on a new corpus reveals high coverage as well. Finally, we describe a morphological disambiguation module that selects the correct analysis of each token in context. The result is a high-quality morphologically-annotated CHILDES corpus of Hebrew, along with a set of tools that can be applied to new corpora.

Entities:  

Keywords:  CHILDES; Hebrew; Morphological analysis; Morphological disambiguation; Transcription of spoken language

Year:  2013        PMID: 25419199      PMCID: PMC4240028          DOI: 10.1007/s10579-012-9214-z

Source DB:  PubMed          Journal:  Lang Resour Eval        ISSN: 1574-020X            Impact factor:   1.358


  8 in total

1.  An empirical generative framework for computational modeling of language acquisition.

Authors:  Heidi R Waterfall; Ben Sandbank; Luca Onnis; Shimon Edelman
Journal:  J Child Lang       Date:  2010-06

2.  Explaining quantitative variation in the rate of Optional Infinitive errors across languages: a comparison of MOSAIC and the Variational Learning Model.

Authors:  Daniel Freudenthal; Julian Pine; Fernand Gobet
Journal:  J Child Lang       Date:  2010-03-25

3.  Morphosyntactic annotation of CHILDES transcripts.

Authors:  Kenji Sagae; Eric Davis; Alon Lavie; Brian Macwhinney; Shuly Wintner
Journal:  J Child Lang       Date:  2010-03-25

4.  Types of linguistic knowledge: interpreting and producing compound nouns.

Authors:  E V Clark; R A Berman
Journal:  J Child Lang       Date:  1987-10

5.  Language development and language knowledge: evidence from the acquisition of Hebrew morphophonology.

Authors:  R A Berman
Journal:  J Child Lang       Date:  1981-10

6.  Automatic parsing of parental verbal input.

Authors:  Kenji Sagae; Brian MacWhinney; Alon Lavie
Journal:  Behav Res Methods Instrum Comput       Date:  2004-02

7.  Children's grammars grow more abstract with age--evidence from an automatic procedure for identifying the productive units of language.

Authors:  Gideon Borensztajn; Willem Zuidema; Rens Bod
Journal:  Top Cogn Sci       Date:  2009-01

8.  Modeling children's early grammatical knowledge.

Authors:  Colin Bannard; Elena Lieven; Michael Tomasello
Journal:  Proc Natl Acad Sci U S A       Date:  2009-10-05       Impact factor: 11.205

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.