| Literature DB >> 31513627 |
Ander Soraluze1, Olatz Arregi2, Xabier Arregi1, Arantza Díaz de Ilarraza1.
Abstract
This paper describes the process of adapting the Stanford Coreference resolution module to the Basque language, taking into account the characteristics of the language. The module has been integrated in a linguistic analysis pipeline obtaining an end-to-end coreference resolution system for the Basque language. The adaptation process explained can benefit and facilitate other languages with similar characteristics in the implementation of their coreference resolution systems. During the experimentation phase, we have demonstrated that language-specific features have a noteworthy effect on coreference resolution, obtaining a gain in CoNLL score of 7.07 with respect to the baseline system. We have also analysed the effect that preprocessing has in coreference resolution, comparing the results obtained with automatic mentions versus gold mentions. When gold mentions are provided, the results increase 11.5 points in CoNLL score in comparison with results obtained when automatic mentions are used. The contribution of each sieve is analysed concluding that morphology is essential for agglutinative languages to obtain good performance in coreference resolution. Finally, an error analysis of the coreference resolution system is presented which have revealed our system's weak points and help to determine the improvements of the system. As a result of the error analysis, we have enriched the Basque coreference resolution adding new two sieves, obtaining an improvement of 0.24 points in CoNLL F1 when automatic mentions are used and of 0.39 points when the gold mentions are provided.Entities:
Mesh:
Year: 2019 PMID: 31513627 PMCID: PMC6742394 DOI: 10.1371/journal.pone.0221801
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1End-to-end coreference resolution for Basque.
Fig 2The architecture of the Basque coreference resolution system.
Examples to illustrate the suitability of the Exact Morphology Match sieve.
| # | Mention | Translation | Lemmas | Number | Definiteness | Coreferent |
|---|---|---|---|---|---|---|
| 1 | txori politak | pretty birds | txori polit | plural | definite | - |
| 2 | txori politekin | with the pretty birds | txori polit | plural | definite | yes |
| 3 | txori politak | pretty bird | txori polit | singular | definite | no |
| 4 | txori politek | pretty birds | txori polit | plural | indefinite | no |
EPEC corpus division information.
| Words | Mentions | Clusters | Singletons | |
|---|---|---|---|---|
| Devel | 30434 | 8432 | 1313 | 4383 |
| Test | 15949 | 4360 | 621 | 2445 |
Performance of baseline and EUSKOR systems with automatic mentions.
| Automatic Mention Detection | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||||||||||||
| R | P | R | P | R | P | R | P | R | P | |||||||
| Baseline | 22.48 | 35.27 | 27.46 | 54.81 | 66.17 | 59.96 | 56.13 | 57.6 | 56.86 | 62.08 | 55.50 | 58.61 | 33.47 | 44.96 | 36.75 | 48.67 |
| EUSKOR | 34.10 | 55.76 | 42.32 | 57.98 | 68.83 | 62.94 | 60.78 | 62.31 | 61.54 | 66.02 | 58.41 | 61.98 | 38.41 | 53.57 | 43.18 | 55.74 |
Performance of baseline and EUSKOR systems with gold mentions.
| Gold Mention Detection | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||||||||||||
| R | P | R | P | R | P | R | P | R | P | |||||||
| Baseline | 31.6 | 43.32 | 36.55 | 76.32 | 86.92 | 81.28 | 72.13 | 72.13 | 72.13 | 80.44 | 72.11 | 76.05 | 59.47 | 71.06 | 62.94 | 64.62 |
| EUSKOR | 48.76 | 71.94 | 58.12 | 81.35 | 93.47 | 86.99 | 80.57 | 80.57 | 80.57 | 89.00 | 78.24 | 83.27 | 67.09 | 84.65 | 72.77 | 76.12 |
Performance of baseline and EUSKOR systems with gold mention boundaries.
| Gold Mention Boundaries | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||||||||||||
| R | P | R | P | R | P | R | P | R | P | |||||||
| Baseline | 31.39 | 43.97 | 36.63 | 76.39 | 87.39 | 81.52 | 72.53 | 72.53 | 72.53 | 81.16 | 72,36 | 76.51 | 59.45 | 71.48 | 62.99 | 64.88 |
| EUSKOR | 48.91 | 70.58 | 57.78 | 81.43 | 93.05 | 86.85 | 80.53 | 80.53 | 80.53 | 88.77 | 78.52 | 83.33 | 67.06 | 83.92 | 72.59 | 75.98 |
Hand-built ordering and Learned ordering.
| Hand-built ordering | Learned ordering |
|---|---|
| S1 Speaker Identification | S1 Speaker Identification |
| S2 Exact Morphology Match | S11 Ellipsis Match |
| S3 Relaxed String Match | S2 Exact Morphology Match |
| S4 Precise Constructs | S3 Relaxed String Match |
| S5 Strict Head Match A | S4 Precise Constructs |
| S6 Strict Head Match B | S8 Proper Head Word Match |
| S7 Strict Head Match C | S6 Strict Head Match B |
| S8 Proper Head Word Match | S5 Strict Head Match A |
| S9 Relaxed Head Match | S7 Strict Head Match C |
| S10 Pronoun Resolution | S10 Pronoun Resolution |
| S11 Ellipsis Match | S9 Relaxed Head Match |
Performance of EUSKOR when sieves are added incrementally.
| MUC | BLANC | CoNLL | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R | P | R | P | R | P | R | P | R | P | |||||||
| Speaker Identification | 0 | 0 | 0 | 50.62 | 74.78 | 60.37 | 53.24 | 54.58 | 53.9 | 67.47 | 48.54 | 56.46 | 27 | 27.39 | 27.19 | 38.94 |
| Ellipsis Match | 0.15 | 22.22 | 0.3 | 50.65 | 74.7 | 60.37 | 53.29 | 54.63 | 53.95 | 67.47 | 48.64 | 56.53 | 27.03 | 38.5 | 27.25 | 39.06 |
| Exact Morphology Match | 23.56 | 73.96 | 35.74 | 55.44 | 73.29 | 63.13 | 59.32 | 60.82 | 60.06 | 68.29 | 54.43 | 60.58 | 34.65 | 64.76 | 39.95 | 53.15 |
| Relaxed String Match | 23.95 | 72.7 | 36.03 | 55.5 | 73.17 | 63.12 | 59.35 | 60.84 | 60.08 | 68.2 | 54.56 | 60.62 | 34.75 | 63.82 | 40.04 | 53.25 |
| Precise Constructs | 23.95 | 71.19 | 35.84 | 55.5 | 73.02 | 63.06 | 59.28 | 60.77 | 60.01 | 68.11 | 54.61 | 60.62 | 34.75 | 62.56 | 39.96 | 53.17 |
| Proper Head Word Match | 26.43 | 68.61 | 38.16 | 56.06 | 72.52 | 63.24 | 59.9 | 61.41 | 60.65 | 68.15 | 55.57 | 61.22 | 35.68 | 58.95 | 40.88 | 54.20 |
| Strict Head Match B | 28.91 | 65.66 | 40.15 | 56.64 | 71.89 | 63.36 | 60.41 | 61.93 | 61.16 | 67.87 | 56.42 | 61.61 | 36.43 | 58.42 | 41.74 | 55.04 |
| Strict Head Match A | 28.91 | 65.66 | 40.15 | 56.64 | 71.89 | 63.36 | 60.41 | 61.93 | 61.16 | 67.87 | 56.42 | 61.61 | 36.43 | 58.42 | 41.74 | 55.04 |
| Strict Head Match C | 30.07 | 63.39 | 40.79 | 56.92 | 71.34 | 63.32 | 60.39 | 61.91 | 61.14 | 67.31 | 56.63 | 61.51 | 36.87 | 57.84 | 42.18 | 55.20 |
| Pronoun Resolution | 32.24 | 58.42 | 41.55 | 57.52 | 69.68 | 63.02 | 60.53 | 62.05 | 61.28 | 66.56 | 57.6 | 61.76 | 37.77 | 54.66 | 42.73 | 55.44 |
| Relaxed Head Match | 34.1 | 55.76 | 42.32 | 57.98 | 68.84 | 62.95 | 60.76 | 62.28 | 61.51 | 66.00 | 58.4 | 61.97 | 38.41 | 53.35 | 43.14 | 55.74 |
Results obtained when automatic mentions are used.
1=EUSKOR, 2=1+Wiki sieve, 3=1+Synonymy sieve, 4=1+Wiki sieve+Synonymy sieve.
| Automatic mentions | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||||||||||||
| R | P | R | P | R | P | R | P | R | P | |||||||
| 1 | 34.1 | 55.76 | 42.32 | 57.98 | 68.83 | 62.94 | 60.78 | 62.31 | 61.54 | 66.02 | 58.41 | 61.98 | 38.41 | 53.57 | 43.18 | 55.74 |
| 2 | 34.41 | 55.70 | 42.54 | 58.09 | 68.64 | 62.93 | 60.73 | 62.26 | 61.49 | 65.94 | 58.49 | 61.99 | 38.65 | 53.27 | 43.35 | 55.82 |
| 3 | 34.57 | 56.03 | 42.76 | 58.08 | 68.80 | 62.98 | 60.85 | 62.38 | 61.61 | 65.99 | 58.51 | 62.03 | 38.53 | 53.65 | 43.31 | 55.92 |
| 4 | 34.88 | 55.90 | 58.19 | 68.60 | 60.80 | 62.33 | 65.92 | 58.60 | 38.77 | 53.33 | ||||||
Results obtained when gold mentions are used.
1=EUSKOR, 2=1+Wiki sieve, 3=1+Synonymy sieve, 4=1+Wiki sieve+Synonymy sieve.
| Gold mentions | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||||||||||||
| R | P | R | P | R | P | R | P | R | P | |||||||
| 1 | 48.76 | 71.94 | 58.12 | 81.35 | 93.47 | 86.99 | 80.57 | 80.57 | 80.57 | 89.00 | 78.24 | 83.27 | 67.09 | 84.65 | 72.77 | 76.12 |
| 2 | 49.84 | 70.81 | 58.50 | 81.71 | 92.83 | 86.92 | 80.57 | 80.57 | 80.57 | 88.69 | 78.77 | 83.44 | 67.51 | 83.27 | 72.84 | 76.28 |
| 3 | 50.00 | 71.50 | 58.85 | 81.69 | 93.19 | 87.06 | 80.80 | 80.80 | 80.80 | 88.90 | 78.82 | 83.56 | 67.39 | 84.23 | 72.95 | 76.49 |
| 4 | 50.46 | 70.99 | 81.86 | 92.81 | 86.99 | 80.71 | 80.71 | 88.71 | 79.00 | 67.68 | 83.34 | |||||
Comparison of performance of adapted Stanford and BART systems.
| Automatic Mention Detection | ||||||
|---|---|---|---|---|---|---|
| MUC | BLANC | CoNLL | ||||
| F1 | F1 | F1 | F1 | F1 | F1 | |
| EUSKOR | ||||||
| BART | 39.86 | 61.48 | 59.38 | 59.84 | 42.41 | 53.72 |
| NEURAL | 8.30 | 58.61 | 53.37 | 55.87 | 29.14 | 40.93 |