| Literature DB >> 30517200 |
Daisaku Shibata1, Kaoru Ito1, Hiroyuki Nagai2, Taro Okahisa2, Ayae Kinoshita3, Eiji Aramaki1.
Abstract
BACKGROUND: Idea density (ID), a natural language processing-based index, was developed to aid in the detection of dementia through the analysis of English narratives. However, it has not been applied to non-English languages due to the difficulties in translating grammatical concepts. In this study, we defined rules to count ideas in Japanese narratives based on a previous study and proposed a novel method to estimate ID in Japanese text using machine translation. MATERIALS: The study participants comprised 42 Japanese patients with dementia aged 69-98 years (mean: 84.95 years). We collected free narratives from the participants to build a speech corpus. The narratives of the patients were translated into English using three machine translation systems: Google Translate, Bing Translator, and Excite Translator. The ID in the translated text was then calculated using the Dependency-based Propositional ID (DEPID), an English ID scoring tool.Entities:
Mesh:
Year: 2018 PMID: 30517200 PMCID: PMC6281229 DOI: 10.1371/journal.pone.0208418
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Machine translations types.
| Service | Method | Powered by |
|---|---|---|
| Google Translate | Neural Machine Translation (NMT) | |
| Bing Translator | Statistical Machine Translation (SMT) | Microsoft |
| Excite Translator | Phrase-Based Machine Translation (PBMT) | Excite Japan |
Spearman’s correlation and slope between the number of ideas in Japanese and English.
| Method | Spearman’s | Slope |
|---|---|---|
| 0.983 | 0.838 | |
| Bing | 0.975 | 0.913 |
| Excite | 0.978 | 0.714 |
Fig 1Correlation diagram between the number of morphemes and the number of tokens.
Spearman’s correlation and slope between the number of morphemes and number of tokens.
| Method | Spearman’s | Slope |
|---|---|---|
| 0.995 | 1.97 | |
| Bing | 0.995 | 1.85 |
| Excite | 0.995 | 2.46 |
Fig 2Procedure to calculate ID using various methods.
Spearman’s correlation between D-ID and the MMSE score.
| Method | Spearman’s |
|---|---|
| 0.105 | |
| Bing | 0.120 |
| Excite | 0.294 |
Spearman’s correlation between DR-ID, DRA-ID, and the MMSE score.
| Version | Method | Spearman’s |
|---|---|---|
| DEPID-R | 0.160 | |
| Bing | 0.060 | |
| Excite | 0.386 | |
| DEPID-R-ADD | 0.473 | |
| Bing | 0.334 | |
| Excite | 0.365 |
Correlation coefficients between the MMSE score and linguistic measures.
Tests for non-correlation were used to calculate p-values. (*, p<0.05).
| Measures | Correlation coefficient between MMSE score and linguistic measures | |
|---|---|---|
| 0.473 | 0.002* | |
| 0.153 | 0.330 | |
| 0.021 | 0.900 | |
| 0.231 | 0.140 | |
| 0.062 | 0.700 | |
| 0.126 | 0.430 | |
| 0.253 | 0.110 | |
| 0.231 | 0.140 | |
| 0.062 | 0.700 |
Examples of translations provided by each machine translation system.
| Original Japanese | Translated by a human | Bing | Excite | ||
|---|---|---|---|---|---|
| (a) | Well, can you please wait a second? | Well, please wait a minute. | Oh, wait a minute. | Please wait a moment. | |
| (b) | I can eat nearly anything. | I do eat most of the food. | I usually eat food. | Everything almost eats food. | |
| (c) | I dislike that crucian carp sushi. | I mean, I hate it. | Carp, I hate sushi. | Uh, I don’t like crucian carp ZU SHI. | |