| Literature DB >> 27746716 |
Sebastian Halder1, Kouji Takano2, Hiroki Ora3, Akinari Onishi2, Kota Utsumi4, Kenji Kansaku3.
Abstract
Gaze-independent brain-computer interfaces (BCIs) are a possible communication channel for persons with paralysis. We investigated if it is possible to use auditory stimuli to create a BCI for the Japanese Hiragana syllabary, which has 46 Hiragana characters. Additionally, we investigated if training has an effect on accuracy despite the high amount of different stimuli involved. Able-bodied participants (N = 6) were asked to select 25 syllables (out of fifty possible choices) using a two step procedure: First the consonant (ten choices) and then the vowel (five choices). This was repeated on 3 separate days. Additionally, a person with spinal cord injury (SCI) participated in the experiment. Four out of six healthy participants reached Hiragana syllable accuracies above 70% and the information transfer rate increased from 1.7 bits/min in the first session to 3.2 bits/min in the third session. The accuracy of the participant with SCI increased from 12% (0.2 bits/min) to 56% (2 bits/min) in session three. Reliable selections from a 10 × 5 matrix using auditory stimuli were possible and performance is increased by training. We were able to show that auditory P300 BCIs can be used for communication with up to fifty symbols. This enables the use of the technology of auditory P300 BCIs with a variety of applications.Entities:
Keywords: P300; assistive technology; auditory stimulation; brain-computer interface; electroencephalography; event-related potentials; gaze independence
Year: 2016 PMID: 27746716 PMCID: PMC5043244 DOI: 10.3389/fnins.2016.00446
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1The Hiragana speller used in this study. The symbols are arranged in a 5 × 10 matrix (A). In this design the user first heard the ten syllables ending with “a” from ten different directions (B). These were used to select the consonant. After presentation of the stimuli used to select the consonant five syllables for selecting the vowel were presented (C). Note that the actual stimulus duration varied due to the length of the sound files used for stimulation (see analysis of sound files in results section). The stimulus-onset-asynchrony of 375 ms was constant.
Figure 2Offline selection accuracy of the healthy participants. Accuracies are shown separately for vowels, consonants, and Hiragana syllables (the latter being correct if both the correct vowel and consonant were selected). The accuracies do not take the decrease of selection times into account that occurred due to the reduction of stimulus repetitions. The second row shows the information transfer rates (ITRs) for vowel, consonant, and Hiragana selection (which conveys the most information per selection) of the healthy participants. The ITRs take the decrease of selection times into account that occurred due to the reduction of stimulus repetitions. The third row the time per selection needed by the healthy participants after the number of stimulus repetitions had been adjusted based on the performance in the calibration session.
Figure 3Selection accuracies of the end-user participating in the study. The end-user improved with each session.
Figure 4ERP responses of the healthy participants in sessions one, two, and three (top left). The ERPs end-user one, who used the same system as the healthy controls, are shown top right. The continuous lines show target responses, the dashed lines non-target responses.
Figure 5The wave forms of the sound stimuli used in the current study. We took the absolute values of the signals and smoothed them with a 1000 sample moving average. These curves are shown in blue for the right and left channel for each stimulus used. Maximum peaks are marked with a red “X.” Periods with less than 1% of the maximum amplitude are marked with a green bar. The column stimuli are shown in the top two rows and the row stimuli in the bottom two rows.
Four selected studies using auditory P300 for spelling.
| Baykara et al., | 25 | 13 of 16 | 77 | Latin alphabet excluding Z |
| Chang et al., | 25 | 0 of 5 | 40 | Hiragana subset |
| Kleih et al., | 27 | 8 of 11 | 84 | Latin alphabet and backspace |
| Current study | 50 | 4 of 6 | 51 | Hiragana syllabary |
Two studies use the Hiragana syllabary and two the Latin alphabet. The list is not exhaustive and serves to put the current study into context with approaches that enable a lower number of selections. The table shows the number of different possible selections, the number of participants (all were healthy volunteers) which reached 70% accuracy, the average accuracy, and which symbol set was used for spelling. The average accuracy given for the current study is averaged across all three sessions (43, 53, and 57% in the first, second, and third session respectively).