| Literature DB >> 27006651 |
Julius Gelšvartas1, Rimvydas Simutis1, Rytis Maskeliūnas2.
Abstract
This paper describes in detail the design of the specialized text predictor for patients with Huntington's disease. The main aim of the specialized text predictor is to improve the text input rate by limiting the phrases that the user can type in. We show that such specialized predictor can significantly improve text input rate compared to a standard general purpose text predictor. Specialized text predictor, however, makes it more difficult for the user to express his own ideas. We further improved the text predictor by using the sematic database to extract synonym, hypernym, and hyponym terms for the words that are not present in the training data of the specialized text predictor. This data can then be used to compute reasonable predictions for words that are originally not known to the text predictor.Entities:
Mesh:
Year: 2016 PMID: 27006651 PMCID: PMC4781939 DOI: 10.1155/2016/3054258
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The statistics of the training data corpora: sentence length histogram and word length histogram.
Figure 2Semantic text predictor algorithm diagram.
Figure 3The TPOC of standard, adapted, and specialized text predictors. Each predictor was configured to return 6 to 2 predictions.
Model type impact results.
| Number of predictions | Model type | |||
|---|---|---|---|---|
| Standard | Adapted | Specialized | ||
| 2 | TPOC, % | 81.05 | 26.9 | 15.31 |
| STD, % | 25.61 | 24.93 | 16.98 | |
| CI, % | 1.54 | 1.5 | 1.02 | |
|
| ||||
| 3 | TPOC, % | 75.97 | 24.33 | 13.45 |
| STD, % | 27.05 | 23.41 | 15.39 | |
| CI, % | 1.63 | 1.41 | 0.93 | |
|
| ||||
| 4 | TPOC, % | 74.81 | 22.93 | 12.29 |
| STD, % | 28.05 | 22.72 | 14.41 | |
| CI, % | 1.69 | 1.37 | 0.87 | |
|
| ||||
| 5 | TPOC, % | 72 | 21.77 | 11.2 |
| STD, % | 29.66 | 22.08 | 13.61 | |
| CI, % | 1.79 | 1.33 | 0.82 | |
|
| ||||
| 6 | TPOC, % | 70.89 | 20.85 | 10.72 |
| STD, % | 30.31 | 21.54 | 13.25 | |
| CI, % | 1.83 | 1.3 | 0.8 | |
Figure 4The TPOC of five language models with different cardinalities.
Model cardinality impact results.
| Model cardinality | Number of predictions | |||||
|---|---|---|---|---|---|---|
| 2 | 3 | 4 | 5 | 6 | ||
| 1 | TPOC, % | 26.09 | 23.03 | 21.21 | 19.76 | 18.76 |
| STD, % | 16.41 | 14.18 | 13.52 | 13.02 | 12.59 | |
| CI, % | 0.99 | 0.86 | 0.82 | 0.79 | 0.76 | |
|
| ||||||
| 2 | TPOC, % | 16.31 | 14.28 | 13.01 | 11.8 | 11.23 |
| STD, % | 16.7 | 15.23 | 14.31 | 13.55 | 13.22 | |
| CI, % | 1 | 0.92 | 0.86 | 0.82 | 0.8 | |
|
| ||||||
| 3 | TPOC, % | 15.31 | 13.45 | 12.29 | 11.2 | 10.72 |
| STD, % | 16.98 | 15.39 | 14.41 | 13.61 | 13.25 | |
| CI, % | 1.02 | 0.93 | 0.87 | 0.82 | 0.8 | |
|
| ||||||
| 4 | TPOC, % | 15.21 | 13.39 | 12.24 | 11.15 | 10.7 |
| STD, % | 17 | 15.39 | 14.41 | 13.61 | 13.25 | |
| CI, % | 1.03 | 0.93 | 0.87 | 0.82 | 0.8 | |
|
| ||||||
| 5 | TPOC, % | 15.21 | 13.39 | 12.24 | 11.15 | 10.7 |
| STD, % | 17 | 15.39 | 14.41 | 13.61 | 13.25 | |
| CI, % | 1.03 | 0.93 | 0.87 | 0.82 | 0.8 | |
Figure 5Word length impact on text predictor performance.