Literature DB >> 29399705

Coreferential Relations in Basque: The Annotation Process.

Klara Ceberio1, Itziar Aduriz2, Arantza Díaz de Ilarraza3, Ines Garcia-Azkoaga4.   

Abstract

In this paper we present the coreferential tagging of part of the EPEC Corpus of Basque. Although coreference is a pragmatic linguistic phenomenon highly dependent on the situational context, it shows some language-specific patterns that vary according to the features of each language. Due to the fact that Basque is not an Indo-European language, it differs considerably in grammar from the languages spoken in surrounding areas. We will explain these features and the decisions made in each case. After describing the criteria defined for coreferential tagging in Basque, the annotation process will be explained. Our annotation is based on a morphologically and syntactically annotated corpus that provides us with a manageable environment, in which the specific structures that are part of a reference chain can be more easily identified. A part of the corpus was tagged by two annotators who marked up the same text independently, and by another annotator that acted as judge, solving problems in case of disagreement. All this process has been automatized as a result of previous studies carried out in this field. The automatic detection of mentions (Soraluze et al., in: Proceedings of Konvens, 2012) has provided us with a better working environment, and given us the possibility to build a first significant corpus for a later computational treatment of automatic coreferential resolution.

Keywords:  Coreference; Coreferential relations; Coreferential tagging

Mesh:

Year:  2018        PMID: 29399705     DOI: 10.1007/s10936-018-9559-6

Source DB:  PubMed          Journal:  J Psycholinguist Res        ISSN: 0090-6905


  2 in total

1.  An extensive review of tools for manual annotation of documents.

Authors:  Mariana Neves; Jurica Ševa
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

2.  EUSKOR: End-to-end coreference resolution system for Basque.

Authors:  Ander Soraluze; Olatz Arregi; Xabier Arregi; Arantza Díaz de Ilarraza
Journal:  PLoS One       Date:  2019-09-12       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.