Literature DB >> 30518304

Distraction by deviant sounds during reading: An eye-movement study.

Martin R Vasilev¹, Fabrice Br Parmentier^2,3,4, Bernhard Angele¹, Julie A Kirkby¹.

Abstract

Oddball studies have shown that sounds unexpectedly deviating from an otherwise repeated sequence capture attention away from the task at hand. While such distraction is typically regarded as potentially important in everyday life, previous work has so far not examined how deviant sounds affect performance on more complex daily tasks. In this study, we developed a new method to examine whether deviant sounds can disrupt reading performance by recording participants' eye movements. Participants read single sentences in silence and while listening to task-irrelevant sounds. In the latter condition, a 50-ms sound was played contingent on the fixation of five target words in the sentence. On most occasions, the same tone was presented (standard sound), whereas on rare and unexpected occasions it was replaced by white noise (deviant sound). The deviant sound resulted in significantly longer fixation durations on the target words relative to the standard sound. A time-course analysis showed that the deviant sound began to affect fixation durations around 180 ms after fixation onset. Furthermore, deviance distraction was not modulated by the lexical frequency of target words. In summary, fixation durations on the target words were longer immediately after the presentation of the deviant sound, but there was no evidence that it interfered with the lexical processing of these words. The present results are in line with the recent proposition that deviant sounds yield a temporary motor suppression and suggest that deviant sounds likely inhibit the programming of the next saccade.

Entities: Chemical Disease Gene Species

Keywords: Deviance distraction; auditory distractors; eye-movements; reading

Mesh：

Year: 2019 PMID： 30518304 PMCID： PMC6613176 DOI： 10.1177/1747021818820816

Source DB: PubMed Journal: Q J Exp Psychol (Hove) ISSN： 1747-0218 Impact factor: 2.143

Oddball studies have shown that task-irrelevant sounds that unexpectedly differ from an otherwise structured or repeated sequence of sounds yield specific electrophysiological responses and behavioural distraction in an unrelated task (Berti, 2008; Berti & Schröger, 2001, 2003; Horváth, Roeber, Bendixen, & Schröger, 2008; Schröger, 1996). In the oddball paradigm, an irrelevant sound is presented before the appearance of a target stimulus on the screen. On most trials, the same sound is presented (standard) whereas on rare and unpredictable occasions it is replaced by a different sound (deviant). The typical finding from such studies is that deviant sounds delay responses in categorisation tasks where participants must respond to target stimuli while ignoring task-irrelevant sounds (Ljungberg & Parmentier, 2012; Parmentier, 2014; Parmentier, Elford, Escera, Andrés, & Miguel, 2008; Parmentier, Vasilev & Andrés, 2018; Parmentier, 2008). Previous research has shown that deviant sounds are distracting because they violate the cognitive system’s predictions (Bubic, von Cramon, Jacobsen, Schröger, & Schubotz, 2009; Parmentier, Elsley, Andrés, & Barceló, 2011). In fact, attentional distraction has been observed at the electrophysiological and behavioural level for both small pitch differences and larger spectral differences between the standard and the deviant sound (Parmentier et al., 2008; Schröger, 1996). In addition, there is abundant evidence showing that deviance distraction does not depend on the specific identity of the sounds: it occurs regardless of whether sound A (e.g., a sinewave tone) is used as the standard and sound B (e.g., white noise) is used as the deviant, or vice versa (Leiva, Parmentier, & Andrés, 2015b). This latter finding has also been shown to generalise to the tactile modality (Parmentier, Ljungberg, Elsley, & Lindkvist, 2011). Interestingly, deviant sounds that convey meaning can also yield distraction because they undergo some automatic semantic evaluation (Parmentier & Kefauver, 2015; Parmentier, Pacheco-Unguetti, & Valero, 2018; Parmentier, Turner, & Elsley, 2011; Parmentier, Turner, & Perez, 2014; Roye, Jacobsen, & Schröger, 2007; Schröger, Giard, & Wolff, 2000). For example, participants show neural responses to the semantic content of unexpected sounds even when they are passively exposed to them (Czigler, Cox, Gyimesi, & Horváth, 2007; Frangos, Ritter, & Friedman, 2005; Friedman, Cycowicz, & Dziobek, 2003; Roye et al., 2007; Shtyrov, Hauk, & Pulvermuller, 2004; Shtyrov & Pulvermuller, 2003). In addition, the semantics of deviant sounds can be processed even when the words’ meaning bears no connection to the primary task (Escera, Yago, Corral, Corbera, & Nuñez, 2003). Finally, deviance distraction can also be modulated by other factors, such as participants’ age. For example, deviant sounds cause greater behavioural distraction in old age under certain conditions (Leiva, Andrés, & Parmentier, 2015; Leiva, Parmentier, & Andrés, 2015a) although this does not appear to reflect age-related differences in the electrophysiological orienting response (Berti, Vossel, & Gamer, 2017). Deviance distraction is traditionally viewed as an involuntary switch of attention away from the main task caused by the detection of subtle auditory changes in the human brain (Escera, Alho, Winkler, & Näätänen, 1998; Schröger, 1996). This is essentially an orienting response (see Sokolov, 1963) characterised by a burst of arousal and a reflexive orienting of attention towards the eliciting stimulus (Näätänen, 1992). Deviant sounds are associated with a specific neurophysiogical signature reflected by three distinct event-related potential (ERP) components: (1) mismatch negativity (MMN; Näätänen, Gaillard, & Mäntysalo, 1978; Näätänen, Paavilainen, Rinne, & Alho, 2007), an early ERP component that reflects a pre-attentive mechanism for detecting auditory changes in the brain (Berti & Schröger, 2001); (2) a P3a component reflecting the involuntary shift of attention towards the deviant sound (Berti & Schröger, 2001; Escera, Alho, Schröger, & Winkler, 2000; Schröger & Wolff, 1998a, 1998b); and (3) a reorientation negativity (RON) component reflecting the refocusing of attention back to the main task (Berti, 2008; Schröger et al., 2000; Schröger & Wolff, 1998a). Interestingly, while deviance distraction has typically been regarded as an example of attentional distraction, recent work suggests that deviant sounds may also temporarily inhibit motor cortical areas (Wessel, 2017; Wessel & Aron, 2013). For example, Wessel and Aron (2013) observed reduced corticospinal excitability following the presentation of novel sounds, leading them to suggest that they activate the same neural circuits that are used to interrupt ongoing actions. The reduction in corticospinal excitability by unexpected sounds has also been found to be positively correlated with action-stopping behaviour in a Go/NoGo task (Dutra, Waller, & Wessel, 2018). This extends the traditional explanation of deviance distraction as an orienting response by suggesting that unexpected sounds may also induce global motor inhibition through the same neural circuits that are used to stop ongoing action plans. Such global inhibition may serve the purpose of temporarily suspending ongoing processes to facilitate the effective and timely processing of unexpected stimuli (Wessel, 2017; Wessel & Aron, 2017). Consequently, the orienting response and motor inhibition accounts are not necessarily mutually exclusive and could both be a consequence of encountering unexpected sounds in the environment. The potential role of motor inhibition in deviance distraction is exciting as it suggests that deviant sounds may potentially affect a large range of activities, including those relying on relatively automatic motor processes. However, despite this potential impact on everyday life situations, previous studies have used simple laboratory tasks that may have limited ecological validity. We address this issue by exploring for the first time whether deviant sounds affect performance on one important and complex everyday task: reading. To do so, we developed a new method to measure the effect of deviant sounds on eye movements during reading. Reading is a theoretically interesting task for studying the effect of deviant sounds on human performance because it does not require any specific response from participants upon hearing the task-irrelevant sounds. Unlike categorisation tasks where participants need to make a dichotomous response after the presentation of the sound (e.g., judging whether a number is odd or even), subjects simply have to read the text for comprehension and ignore the task-irrelevant sounds. This makes it possible to investigate how deviant sounds affect performance on a natural, everyday task that does not involve any response preparation or the need to act upon a specific stimulus after the sound is presented. In addition, skilled adult reading is a fairly automatised process (LaBerge & Samuels, 1974), which involves the intricate coordination of oculomotor and cognitive processes that determines when and where to move the eyes next. As a result, it can yield valuable insights into how deviant sounds influence cognitive and oculomotor processes. While there is a very long history of research into auditory distraction, most studies have only considered the influence of continuous auditory distractors (e.g., irrelevant speech or music) on behavioural measures such as comprehension accuracy (Vasilev, Kirkby, & Angele, 2018). However, recording participants’ eye movements makes it possible to investigate how irrelevant sounds affect the moment-to-moment decision of when and where to move the eyes next. Because eye movements during reading are sensitive to the underlying cognitive processing of the text (Rayner, 1998, 2009), this method has the potential to uncover subtle auditory-distraction effects that may not appear in behavioural measures of comprehension. It is not known whether eye movements during reading are sensitive to discrete deviant sounds. However, recent work does indicate that they are affected by certain types of continuous sounds. For example, background music and unintelligible speech in a foreign language do not appear to affect fixation durations or fixation probabilities during reading (Cauchard, Cane, & Weger, 2012; Hyönä & Ekholm, 2016; Johansson, Holmqvist, Mossberg, & Lindgren, 2012). However, semantically meaningful irrelevant speech disrupts the reading process by prompting participants to make more re-reading fixations on previously read words (Cauchard et al., 2012; Hyönä & Ekholm, 2016; Yan, Meng, Liu, He, & Paterson, 2018). This latter finding is interesting because it suggests that certain task-irrelevant sounds can have a direct influence on eye movements and interfere with the ongoing cognitive processing of the text. While discrete deviant sounds could also potentially influence fixation durations during reading, this is not expected to occur through the same mechanism that is responsible for distraction by meaningful speech. The available evidence suggests that meaningful speech causes distraction because readers process its semantic features, which in turn interferes with extracting the meaning of the written text (e.g., Hyönä & Ekholm, 2016; Martin, Wogalter, & Forlano, 1988; see also Marsh, Hughes, & Jones, 2008, 2009). However, we hypothesise that deviant sounds cause distraction not because of semantic interference, but rather because they violate readers’ expectation that another standard sound will be presented (Bubic et al., 2009; Parmentier, Elsley, et al., 2011). Therefore, this study builds upon previous work on distraction by continuous sounds by exploring a different mechanism through which task-irrelevant sounds may influence eye movements during reading.

Present study

We developed a new manipulation in which the presentation of task-irrelevant sounds was contingent on participants’ eye movements. While participants read sentences for comprehension, short sounds were presented upon fixating five target words in each sentence. On most occasions, the sound was a sine-wave tone (i.e., the standard sound) whereas on rare and unpredictable occasions it was replaced by a short burst of white noise (i.e., the deviant sound). Few studies to date have used gaze-contingent auditory presentation in a reading task (Eiter & Inhoff, 2010; Inhoff, Connine, Eiter, Radach, & Heller, 2004; Inhoff, Connine, & Radach, 2002). In these studies, participants heard an irrelevant spoken word once their eyes crossed an invisible boundary (Rayner, 1975) located just before a target word in the sentence. In this study, an invisible boundary was inserted before each of the five target words in the sentence. If deviant sounds in this task elicit an orienting response (e.g., Escera et al., 1998; Parmentier, 2014), this may happen either overtly or covertly. According to Posner (1980), overt orienting occurs when there is an eye movement directed towards the eliciting stimulus. Conversely, covert orienting occurs when there is a shift of attention without any corresponding eye movements. If deviant sounds trigger an overt shift of attention, this would manifest in shorter fixation durations after the presentation of the sound because the current fixation will be interrupted and the orienting response (i.e., the eye movement) will occur. However, because the deviant stimulus is in the auditory domain and because it is not linked to a specific location on the screen, it is more likely that the orienting response would occur covertly (i.e., without an eye movement). In this case, attention would be redirected away from processing the words in the sentence and towards the deviant sound, resulting in an increase in fixation durations due to a disruption in the lexical processing of the target words. Alternatively, if the orienting response occurs later in time, it may disrupt the allocation of attention to other words in the sentence, or the post-lexical stages of sentence integration. Similarly, if deviant sounds elicit global motor inhibition (Wessel & Aron, 2013), this would also lead to longer fixation durations after the presentation of the sound, likely due to a delay in saccade programming. On the basis of previous work suggesting that deviant sounds delay the processing of target stimuli (Parmentier et al., 2008) and the notion that this might reflect the temporary suppression of ongoing actions (Wessel & Aron, 2013), we hypothesised that the deviant sounds would lead to increased fixation durations on the target words. In addition, to determine if any potential increase in fixation durations is due to a disruption in the initial stages of lexical processing, we examined the time course of the effect and tested whether it is modulated by the corpus lexical frequency of the target words.

Method

Participants

Forty-eight Bournemouth University students participated for course credit[1] (45 female). Their average age was 19.7 years (SD = 2.4 years, range: 18-32 years). All were native speakers of British English, reported normal or corrected-to-normal vision, normal hearing, no prior diagnosis of reading disorders, and were naïve as to the purpose of the experiment. The study was approved by the Bournemouth University Research Ethics Committee (protocol No. 16999). Informed written consent was obtained from all participants who took part in the study.

Materials and design

The standard sound was a 400-Hz sine wave and the deviant sound was a burst of white noise. The sounds were generated in Matlab R2014a (MathWorks, 2014). Both sounds were 50 ms long and had a 10-ms fade-in and fade-out. The amplitude resolution of the sounds was 16 bits and the sampling frequency was 48 kHz. The reading materials consisted of 120 English sentences (see the Supplementary Material). Their average length was 14.3 words (range: 13-18 words). As illustrated in Figure 1a, each sentence contained five target words. The sound stimuli were played once the target words were fixated (we use the term “target” to denote the words on which sounds were played, as opposed to other words in the sentence where no sounds were played). The 5 target words were always the 3rd, 5th, 7th, 9th, and 11th word in the sentence. Their mean length was 6.75 letters (SD = 1.89 letters, range: 3-13 letters). The average lexical frequency of the target words was 180 counts per million in the SUBTLEX-UK database (Van Heuven, Mandera, Keuleers, & Brysbaert, 2014; SD = 786, range: 0.06-15,450 counts per million). Short function words that are likely to be skipped during first-pass reading were not used as targets. Target words were always separated by one non-target word, which served as buffer to increase the sound inter-stimulus interval. In addition, there were always at least two words following the last target word in a sentence to avoid artefacts due to sentence wrap-up effects.

Figure 1.

An illustration of the sound presentation in the experiment. (a) An example sentence and the position of the target words. (b) The gaze-contingent sound presentation. The sound was played once the eye moved to the right of each invisible boundary (illustrated here as red dashed lines). Because the reading stimuli were specifically developed for this experiment and were not part of a pre-existing corpus, a norming study was carried out to determine their difficulty and naturalness. Twelve participants (nine female, mean age = 29.1 years) who did not take part in the eye-tracking experiment were asked to read each sentence and rate it on a scale from 1 to 10 based on how difficult and how natural they thought it was (1 = very difficult/ very unnatural; 10 = very easy/ very natural). The results showed that the sentences were rated both as very natural (M = 8.84, SD = 1.53) and very easy to understand (M = 9.29, SD = 1.23) by speakers of English. There were two experimental blocks: one block of 40 sentences that was completed in silence, and another block of 80 sentences that contained the gaze-contingent sound presentation. The silence block was added to determine whether the presentation of five gaze-contingent standard sounds leads to a change in reading behaviour compared to reading in silence. This was important as previous studies have not presented multiple gaze-contingent sounds in a reading task. The gaze-contingent block contained 2 types of trials: (1) 40 trials contained five standard sounds and (2) 40 trials contained four standard sounds and one deviant sound. In trials with a deviant sound, the first sound (on Target Word 1) was also always a standard sound. This was done to re-activate the representation of the standard sound at the beginning of the trial. The deviant sound was then presented on one of the four remaining target words (two through five) with equal probability across the experiment. Overall, deviant sounds represented 10% of all sounds in the experiment. The assignment of conditions to sentences, the position of the deviant sound, and the order of the two experimental blocks were counter-balanced with a full Latin square design. The sentences within each block appeared in random order. The gaze-contingent block always started with three standard-only trials to establish the sine wave as the standard sound.

Apparatus

Participants’ eye movements were recorded with an EyeLink 1000 Tower Mount at a sampling frequency of 1,000 Hz. The resolution noise was <0.01° and the velocity noise was <0.5° on average. Participants rested their chin on a headrest to minimise head-movement artefacts. Viewing was binocular, but only the right eye was recorded. The experiment was programmed in Matlab R2014a (MathWorks, 2014) with the Psychophysics toolbox (Brainard, 1997; Pelli, 1997) and Eyelink (Cornelissen, Peters, & Palmer, 2002) libraries. The sentences were presented on a Cambridge Research Systems LCD ++ monitor (screen resolution: 1920 × 1080 pixels; refresh rate: 100 Hz). The sentences were formatted in a Courier New 18pt. font and appeared as black text over white background on a single line in the middle of the screen. The width of each letter was 14 pixels. The distance between the eye and the monitor was 80 cm. At this distance, each letter subtended a visual angle of approximately 0.34°. The sounds were played through a Creative Labs Sound Blaster X-Fi SB0770 sound card using the low-latency presentation mode of the Psychophysics toolbox. The sound stimuli were presented binaurally at 65 dB(A) through Bose QuietComfort 25 noise-cancelling headphones. The experiment was performed on a personal computer (PC) running Windows 7.

Procedure

Participants were tested individually in a 30- to 40-min session. Participants were instructed to ignore the task-irrelevant sounds and focus on the reading task. Before the experiment, participants were calibrated on a three-point calibration grid. A drift check was presented before each trial and participants were re-calibrated whenever necessary. The calibration error was kept at <0.3° across the experiment. The experiment started with six silent practice trials. Each trial began with a black gaze box that appeared at 50 pixels on the left side of the screen. Upon fixating this box for 100 ms, it disappeared and the sentence was presented on the screen, with the first letter appearing in the middle of the location where the box was. The gaze-contingent sound manipulation is illustrated in Figure 1b. An invisible boundary (Rayner, 1975) was placed at the first pixel of the empty space before each of the five target words. Once the eye crossed a boundary, the command to play the sound was sent. The delay between sending this command and the actual onset of the sound through the headphones was 14 ms. However, because the boundary was usually crossed during a saccade towards the target word, the sound onset delay relative to fixation onset was normally much less (mean delay = 4.8 ms, SD = 8.9 ms). To avoid sound overlap when participants made a long saccade and crossed more than one boundary, a sound was played only when at least 10 ms had passed since the offset of the previous sound. Each sound was played only once when the target word was first fixated; the sound was not repeated if the target word was subsequently re-fixated during a regression. Participants pressed a button on the mouse to terminate each trial. Comprehension of the sentences was assessed with a “Yes/No” question after 33% of the trials. For example, in the sentence “Cathy was nervous about giving the short speech before the panel of judges.” the comprehension question was “Was Cathy relaxed about giving her speech? Yes/No.”

Data analysis

We analysed fixations on Target Words 2 to 5 as a function of the sound condition (silence, standard, deviant). To ensure that the comparisons were equivalent with respect to the words’ position, one target word was sampled per trial in the silence and standard sound conditions. This word was selected based on the design matrix used to allocate the deviant sounds to word positions in trials where a deviant sound was presented. This made it possible to form a fully balanced dataset with an equal number of observations per sound condition and target word position. We analysed four standard fixation duration measures: first fixation duration (FFD; the duration of the first fixation on a word), single fixation duration (SFD; the duration when a word is fixated only once), gaze duration (GD; the sum of all fixations on a word before the eyes move on to another word), and total viewing time (TVT; the sum of all fixations on a word, including the ones made during a regression). In addition, a few measures of global reading were also analysed: sentence reading time, fixation duration, number of fixations, and saccade length. The data were analysed with (Generalised) linear mixed models ([G]LMMs) using the lme4 package v.1.1-12 (Bates, Machler, Bolker, & Walker, 2014) in R 3.3.0 (R Core Team, 2016). Sound type was entered as a fixed effect in the model. Random intercepts and random slopes for sound type were added for both participants and items[2] (Baayen, Davidson, & Bates, 2008; Barr, Levy, Scheepers, & Tily, 2013). Fixation durations were log transformed in all analyses. Treatment contrast coding was used with the standard sound as the baseline. This contrast coding made it possible to do the two key comparisons in the experiment: (1) silence versus standard sound—to determine whether the presentation of gaze-contingent sounds in general influenced reading behaviour compared to silent reading and (2) deviant versus standard sound—to test for the presence of a sound deviance effect. The results were considered statistically significant if the|t| or|z| values were ⩾1.96. Standardised effect sizes in Cohen’s d (Borenstein, 2009) are also reported for the significant results.

Results

The average comprehension accuracy was 95%, thus indicating that participants understood the sentences (Silence: M = 95.1%, SD = 21.7%; Standard: M = 94.7%, SD = 22.4%; Deviant: M = 95.2%, SD = 21.3%). There were no differences in comprehension accuracy across the sound conditions (all|z|s < 0.49). After the experiment, only two participants (4.1%) reported some awareness that the sounds were played depending on their eye movements.[3] Trials with blinks (6.2%) and trials in which the command to play the sound was sent after the fixation onset of the target word (12.2%) were excluded from the data. In addition, trials with boundary “hooks” (8.5%) were also excluded. A hook occurs when the gaze crosses the invisible boundary and triggers the sound, but then returns to the left of the boundary and lands on a previous word. Fixations shorter than 80 ms that occurred within one letter of another fixation were combined with that fixation. Trials with fixation durations longer than 800 ms for FFD, 2,000 ms for GD, and 4,000 ms for TVT (0.44%) were excluded as outliers in all analyses. This left 72.6% of the data for analysis (a total of 4,206 trials).

Global reading

The descriptive statistics and linear mixed model (LMM) results for global reading measures are shown in Tables 1 and 2, respectively. No significant differences were observed between the sound conditions on any of these measures: deviant sounds did not affect performance relative to the standard condition, and performance was comparable in the silence and standard conditions. Therefore, global reading was not disrupted by deviant sounds or the presence of gaze-contingent sounds in general.

Table 1.

Mean descriptive statistics for global reading measures.

Sound	Sentence reading time (in ms)	Fixation duration (in ms)	Number of fixations	Saccade length (in letters)
Silence	3,670 (1,781)	237 (114)	15.4 (6.25)	9.76 (9.44)
Standard	3,580 (1,715)	237 (109)	15.1 (5.81)	9.59 (9.04)
Deviant	3,560 (1,697)	236 (110)	15.1 (5.83)	9.51 (8.64)

Standard deviations in parenthesis.

Table 2.

Results from LMMs for global reading measures.

Effect	Sentence reading time[a]			Fixation duration			Number of fixations			Saccade length[a]
Effect	b	SE	t	b	SE	t	b	SE	t	b	SE	t
Intercept	8.09	0.04	185	5.38	0.02	282.5	15.08	0.56	26.75	2.03	0.02	95.82
Deviant vs. standard	<–0.01	0.01	−0.57	<–0.01	<0.01	−1.24	−0.01	0.13	−0.08	<–0.01	0.01	−0.57
Silence vs. standard	0.02	0.03	0.81	<0.01	<0.01	0.09	0.35	0.42	0.83	0.01	0.01	0.87

Statistically significant t-values are formatted in bold.

LMMs = linear mixed models; SE = standard error.

The random intercept for subjects was removed due to convergence failure.

Mean descriptive statistics for global reading measures. Standard deviations in parenthesis. Results from LMMs for global reading measures. Statistically significant t-values are formatted in bold. LMMs = linear mixed models; SE = standard error. The random intercept for subjects was removed due to convergence failure.

Target words

As Table 3 shows, fixation durations were significantly longer immediately following the deviant sound compared to the standard sound (FFD: b = 0.03, SE = 0.02, t = 2.02, d = 0.20; SFD: b = 0.04, SE = 0.02, t = 2.43, d = 0.29; GD: b = 0.05, SE = 0.02, t = 2.8, d = 0.21; TVT: b = 0.04, SE = 0.02, t = 2.16, d = 0.18). However, fixation durations were comparable in the standard and silent conditions (FFD: b = 0.02, SE = 0.01, t = 1.35; SFD: b = 0.03, SE = 0.02, t = 1.86; GD: b = 0.01, SE = 0.02, t = 0.2; TVT: b = 0.01, SE = 0.02, t = 0.55). When the position of the target word in the sentence was added as a fixed effect in the model, there were no significant interactions with the contrast between the standard and the deviant sound (all|t|s ⩽ 1.75). This shows that the deviance effect was not modulated by the position of the target word in the sentence. Furthermore, there was a 3% greater probability of making a regressive saccade immediately after hearing the deviant sound compared to hearing the standard sound (b = 0.25, SE = 0.13, z = 1.96, d = 0.25; Deviant: M = 21.3%, SD = 40.9%; Standard: M = 17.9%, SD = 38.3%). No such difference was observed between the standard sound and the silence condition (b = 0.14, SE = 0.12, z = 1.23; Standard: M = 17.9%, SD = 38.3%; Silence: M = 19.2%, SD = 39.4%). These results clearly indicate that the deviant sound significantly and selectively affected fixation durations on the target word immediately after presentation. However, this effect did not appear in the global reading measures because they are averages across all words in the sentence whereas the deviant sound manipulation was specific to fixating a single word.

Table 3.

Mean fixation durations on the target words.

Sound type	FFD	SFD	GD	TVT
Silence	246 (83)	250 (83)	296 (137)	362 (222)
Standard	243 (85)	243 (83)	301 (160)	356 (222)
Deviant	254 (102)	255 (98)	317 (166)	375 (232)

FFD = first fixation duration; SFD = single fixation duration; GD = gaze duration; TVT = total viewing time.

Standard deviations in parenthesis.

Mean fixation durations on the target words. FFD = first fixation duration; SFD = single fixation duration; GD = gaze duration; TVT = total viewing time. Standard deviations in parenthesis. Post-hoc analyses were carried out to test if the disruption by the deviant sound also affected fixation durations on the subsequent word in the sentence (i.e., the post-target word). The results (presented in the Supplementary Material) indicated that the deviant sound had no effect on fixation durations on the next word in the sentence, which suggests that the disruption occurred before the next word was fixated. In addition, the magnitude of the deviance effect on the target words was not modulated by whether participants were slow or fast readers (see the Supplementary Material). To establish the point in time where the deviant sound first starts to affect fixation durations, we used the confidence interval divergence point analysis (CI-DPA; Reingold & Sheridan, 2014, 2018). This survival analysis technique can determine the earliest point in time where the distributions of two experimental conditions begin to diverge (i.e., significantly differ from one another). The CI-DPA analysis was run with 10,000 bootstrap iterations on the FFD of the target words using the method described in Reingold and Sheridan (2018). The analysis indicated that the deviant sound first started to affect fixation durations at 180 ms (95% CI = [167, 198]). This is illustrated in Figure 2.

Figure 2.

Survival curves of the first fixation during which participants heard the sound. The divergence point (at 180 ms) is shown by the vertical orange line.

Survival curves of the first fixation during which participants heard the sound. The divergence point (at 180 ms) is shown by the vertical orange line. Finally, the lexical frequency of target words was entered into a model with the fixation durations in the three sound conditions. If the deviant sound interfered with the lexical processing of the target words, we would expect to see an interaction between lexical frequency and the deviant condition. The results showed no significant interactions between lexical frequency and the deviant sound (see Table 4), suggesting that the deviant sound did not interfere with accessing the lexical representation of words.

Table 4.

Interactions between fixation durations on the target words and corpus lexical frequency.

Effect	FFD			SFD			GD
	b	SE	t	b	SE	t	b	SE	t
Intercept	5.44	0.02	254.29	5.45	0.02	251.45	5.60	0.03	191.26
Frequency	−0.02	0.01	–2.55	−0.03	0.01	–3.50	−0.05	0.01	–4.99
Deviant	0.03	0.02	2.02	0.04	0.02	2.37	0.05	0.02	2.86
Standard	0.02	0.01	1.31	0.03	0.01	1.86	< 0.01	0.02	0.18
Frequency: Deviant	< 0.01	0.01	0.36	0.02	0.01	1.52	0.02	0.02	1.27
Frequency: Standard	−0.01	0.01	−0.83	−0.01	0.01	−0.39	−0.01	0.02	−0.35

FFD = first fixation duration; SFD = single fixation duration; GD = gaze duration; SE = standard error.

Statistically significant t-values are formatted in bold. Lexical frequency was log transformed.

Interactions between fixation durations on the target words and corpus lexical frequency. FFD = first fixation duration; SFD = single fixation duration; GD = gaze duration; SE = standard error. Statistically significant t-values are formatted in bold. Lexical frequency was log transformed.

Discussion

The present results showed clear evidence of deviance distraction in eye movements during reading. Indeed, fixation durations on the target words were longer after hearing the deviant sound compared to hearing the standard sound. Interestingly, global reading measures were unaffected by either sound—the mere presence of gaze-contingent sounds did not influence reading behaviour. However, deviant sounds prolonged fixation durations on the currently read word at the time of the sound’s presentation. As comprehension accuracy was not affected by the presentation of a deviant sound in the sentence, it is unlikely that the observed longer fixation durations on the target words are related to comprehension difficulties. This is in line with previous studies demonstrating that irrelevant speech can disrupt fixation durations during reading without an associated disruption in comprehension (Cauchard et al., 2012; Hyönä & Ekholm, 2016; Yan et al., 2018). The time-course analysis of deviance distraction and the absence of modulation by lexical frequency enabled us to further localise the source of the effect. For example, in the E-Z Reader model of eye-movement control during reading (Reichle, Pollatsek, Fisher, & Rayner, 1998; Reichle, Rayner, & Pollatsek, 2003; Reichle, Warren, & McConnell, 2009), word processing starts with an early visual processing stage during which the visual features of the word are propagated from the retina to the brain. This stage is then followed by two lexical-processing stages: familiarity check (L1) and lexical access (L2). In this model, completion of L1 initiates the programming of the next saccade because the next stage (L2) is likely to be completed soon (Reichle et al., 1998). The programming of the next saccade also happens in two stages: a labile stage (M1) during which the saccade programme can be cancelled and a non-labile stage (M2) during which the saccade programme can no longer be cancelled. Our data suggest that the deviant sound did not interfere with the lexical-processing stages for a few reasons. First, the CI-DPA analysis indicated that the earliest discernible effect of the deviant sound occurs somewhat late, at 180 ms. This exceeds the temporal estimates of lexical processing reported in the neurophysiological literature (127-172 ms on average; Reichle & Reingold, 2013). Second, Reingold and Sheridan (2014) used the CI-DPA analysis to estimate that the lexical frequency effect (i.e., the difference in fixation times between high- and low-frequency words) starts at 138 ms after fixation onset, that is, some 42 ms earlier than the sound deviance effect. Finally, deviance distraction was not modulated by target-word lexical frequency. Taken together, these findings suggest that deviant sounds did not increase fixation durations on the target words because of delayed lexical processing. As the deviance effect appears to occur after the lexical-processing stages of reading, we hypothesise that the interference is likely due to saccadic inhibition during the programming of the next saccade. This conclusion is generally consistent with Wessel and Aron’s (2013) proposition of a general action suppression upon the presentation of a deviant sound. In their study, action inhibition took the form of a reduced corticospinal excitability of the hand following transcranial magnetic stimulation (TMS) of the corresponding motor cortex some 150 ms after the deviant sound’s onset. Because the programming and execution of saccades involves subcortical structures, such as the superior colliculus, cerebellum, and the brainstem (Munoz, 2002), our results in fact extend Wessel and Aron’s (2013) by suggesting that deviant sounds may also inhibit subcortical brain areas. Similar evidence for saccadic inhibition was also found in a study where a sequence of auditory distractors (which also included a deviant) was presented in a picture viewing task (Graupner, Velichkovsky, Pannasch, & Marx, 2007). Graupner et al. reported that the deviant distractor reduced the proportion of terminated fixations, first at around 90 ms and then at around 150 ms, a finding they interpreted as “first” and “second” saccadic inhibition. The secondary inhibition is consistent with both the present findings and Wessel and Aron’s (2013) results. Further evidence in support of this saccadic inhibition account is the finding that the deviant effect was not modulated by the position of the target word in the sentence. Because inhibition of the oculomotor system should be independent of any underlying word identification or syntactic processes, the effect should not vary with respect to where it occurs in the sentence. Even though deviant sounds had no effect on spatial measures of saccades (e.g., saccade length), this is not inconsistent with the saccadic inhibition explanation because the hypothesised inhibition occurs during the planning stages rather than during the execution of the next saccade. This is in line with previous evidence showing that transient visual changes can result in saccadic inhibition some 60 to 70 ms after the onset of the display change (Reingold & Stampe, 1999, 2002, 2004). This saccadic inhibition explanation is in line with the notion that deviant sounds capture attention away from ongoing processing (e.g., Escera et al., 1998; Schröger, 1996) and inhibit motor processes (Wessel & Aron, 2013). The present experiment contributes to our understanding of the time course of this effect by suggesting that a covert orienting of attention to the deviant sound does not occur during the lexical-processing stages of the fixated word but at a subsequent stage (namely, the preparation of the next saccade). In oddball tasks, the effect of deviant sounds on electrophysiological measures of the orienting response is traceable from about 150 to 600 ms from the sound’s onset (Berti & Schröger, 2003; Escera et al., 1998). So far, the temporal dynamics of the motor inhibition yielded by deviant sounds remain to be established, but early evidence places it at around 150 ms after the sound’s onset. Our experiment does not allow us to disentangle the potential contributions of the orienting response and the motor inhibition. Exploring this issue could be an objective for future research. For example, the precise time course of the orienting response in our task could be studied by using electroencephalogram (EEG) and eye-tracking co-registration (e.g., Dimigen, Sommer, Hohlfeld, Jacobs, & Kliegl, 2011; Plöchl, Ossandón, & König, 2012), capitalising on the well-known ERP signature of the orienting response (i.e., the P3a component; Berti & Schröger, 2001) to study its potential correlation with the observed disruption in eye movements and to better understand its influence on the reading process. It should be noted that the size of the deviance-distraction effect in the present experiment was relatively small: the numerical effect size was 14.5 ms on average and the standardised effect size was d = 0.22 on average. While the numerical effect size was generally comparable to that of previous studies using the auditory–visual cross-modal task (mean effect size = 17 ms), the standardised effect size was almost three times smaller (mean d = 0.65 in previous oddball experiments; Andrés, Parmentier, & Escera, 2006; Escera et al., 1998; Ljungberg, Parmentier, Leiva, & Vega, 2012; Parmentier, 2016; Parmentier et al., 2008; Parmentier, Elsley, & Ljungberg, 2010; Parmentier, Vasilev & Andrés, 2018; Wetzel, Schröger, & Widmann, 2013). This discrepancy between the numerical and standardised effect size is most likely due to the greater variability in eye-movement responses compared with behavioural reaction time responses. We speculate that our reading task may introduce more variability into the data because, unlike categorisation tasks, it does not require a specific response from participants upon the presentation of task-irrelevant sounds. Nevertheless, it may be possible to increase the magnitude of the effect in future studies. For example, this could be done by using novel sounds that are not repeated throughout the experiment (Berti, 2012; Escera et al., 1998; Wetzel, Schröger, & Widmann, 2016; cf. Wetzel et al., 2013) or by studying children or elderly populations who can show greater distraction (Andrés et al., 2006; Gumenyuk et al., 2001; Gumenyuk, Korzyukov, Alho, Escera, & Näätänen, 2004; Leiva, Andrés, Servera, Verbruggen, & Parmentier, 2016; Leiva et al., 2015a; Wetzel, Scharf, & Widmann, 2018; Wetzel et al., 2016; cf. Leiva et al., 2016). Traditionally, the majority of research on deviance distraction has used categorisation tasks, such as judging the parity of numbers presented on the screen (e.g., Parmentier et al., 2008) or judging the duration of the irrelevant sound (short vs. long; e.g., Schröger & Wolff, 1998b). Critically, the present experiment differs from such studies in that it did not require any response from participants after the presentation of task-irrelevant sounds. Rather, participants simply needed to do well overall on the task (i.e., comprehend the sentences) and ignore the sounds. In contrast, participants in categorisation tasks typically need to make a binary response after each sound. As a result, the sound may serve as an unspecific warning signal that alerts them to the imminent presentation of a target stimulus that requires a response (Parmentier, 2014). While deviance distraction in oddball tasks occurs even when the sound does not fulfil this unspecific warning function (Parmentier, 2016), the evidence for this again comes from studies that required a response on at least some of the trials (typically on half of them; see Ljungberg et al., 2012; Parmentier et al., 2010; Wetzel et al., 2013). Therefore, the present experiment contributes to our theoretical understanding of deviance distraction by showing that deviant sounds disrupt ongoing processes even when the sounds are not relevant in any way to the main task and do not constitute cues for goal-directed behaviour. More broadly, the present results are also relevant for understanding how the external auditory environment can influence readers’ eye-movement behaviour. Recent work has shown that certain sounds, such as meaningful speech, can result in attentional distraction that is detectible at the level of eye fixations whereas other sounds, such as background music or unintelligible speech, generally have no influence on eye-movement behaviour (Cauchard et al., 2012; Hyönä & Ekholm, 2016; Johansson et al., 2012; Yan et al., 2018). These results suggest that certain auditory environments can interfere with readers’ ability to maintain sustained attention on the task and process the text in an efficient way. The present experiment builds upon these findings by demonstrating that deviant sounds that violate readers’ expectations can also result in an immediate disruption of eye-movement behaviour. However, unlike continuous distractors such as meaningful speech that typically result in an increase in re-reading behaviour (e.g., Hyönä & Ekholm, 2016; Yan et al., 2018), deviance distraction may occur due to motor inhibition that interferes with the programming of the next saccade. Another interesting finding in the present experiment was that the standard sound did not lead to significantly longer fixation durations compared with silence. This is in contrast to some evidence showing that the presentation of a short auditory distractor similar to the standard sound used in this study can lead to an increase in fixation durations (e.g., Pannasch, Dornhoefer, Unema, & Velichkovsky, 2001; Pannasch & Velichkovsky, 2009, Experiment 3; but see Reingold & Stampe, 2004, Experiment 1). The lack of difference may have occurred because the standard sound was not long enough for it to cause any meaningful distraction. In addition, the standard sound was presented very frequently and in quick succession, which may have helped participants habituate to it. This is consistent with previous evidence showing that participants can habituate to distracting sounds, such as irrelevant speech (e.g., Banbury & Berry, 1997; Bell, Röer, Dentale, & Buchner, 2012; Morris & Jones, 1990). One limitation of this study was that it did not experimentally manipulate the psycholinguistic properties of target words, such as their lexical frequency or predictability given the preceding sentence context. Therefore, future studies should include a lexical frequency or a predictability manipulation on the target words to replicate and extend the present results. If the sound deviance effect is due to inhibition during the programming of the next saccade, it should fail to interact with these variables and it should lead to an increase in fixation durations that is independent of the cognitive processing of the word. The present data also do not allow us to determine with certainty the specific component of saccade programming that may be affected. For example, both the E-Z Reader (Reichle et al., 1998) and SWIFT (Engbert, Nuthmann, Richter, & Kliegl, 2005) models assume that saccade programming occurs in a labile (i.e., cancellable) and a non-labile (i.e., non-cancellable) stage. At present, it is not known which of the two stages may be affected. However, if this effect is due to a general motor inhibition (see Wessel & Aron, 2013), it can be speculated that the resulting inhibition should be generally similar regardless of the exact saccade programming stage in which it occurs. In summary, the present experiment introduced a new method to study deviance distraction during reading by utilising a gaze-contingent presentation of task-irrelevant sounds (e.g., Inhoff et al., 2002). The results showed that deviant sounds lead to an increase in fixation durations, which is most likely due to saccadic inhibition. This finding contributes to our growing understanding of how task-irrelevant sounds influence eye-movement control during reading by showing that unexpected sounds can have an immediate effect on the programming of the next saccade. More broadly, this study also raises the possibility that unexpected sounds may inhibit ongoing motor processes in everyday tasks similar to reading that rely on relatively automatic motor control. Click here for additional data file. Supplemental material, Supplementary_Materials_rev for Distraction by deviant sounds during reading: An eye-movement study by Martin R Vasilev, Fabrice BR Parmentier, Bernhard Angele and Julie A Kirkby in Quarterly Journal of Experimental Psychology

81 in total

Review 1. Involuntary attention and distractibility as evaluated with event-related brain potentials.

Authors: C Escera; K Alho; E Schröger; I Winkler
Journal: Audiol Neurootol Date: 2000 May-Aug Impact factor: 1.854

2. Auditory distraction: event-related potential and behavioral indices.

Authors: E Schröger; M H Giard; C Wolff
Journal: Clin Neurophysiol Date: 2000-08 Impact factor: 3.708

3. The omnipresent prolongation of visual fixations: saccades are inhibited by changes in situation and in subject's activity.

Authors: S Pannasch; S M Dornhoefer; P J Unema; B M Velichkovsky
Journal: Vision Res Date: 2001 Impact factor: 1.886

4. Saccadic inhibition in voluntary and reflexive saccades.

Authors: Eyal M Reingold; Dave M Stampe
Journal: J Cogn Neurosci Date: 2002-04-01 Impact factor: 3.225

Review 5. Commentary: saccadic eye movements: overview of neural circuitry.

Authors: Douglas P Munoz
Journal: Prog Brain Res Date: 2002 Impact factor: 2.453

6. A contingent speech technique in eye movement research on reading.

Authors: Albrecht W Inhoff; Cynthia Connine; Ralph Radach
Journal: Behav Res Methods Instrum Comput Date: 2002-11

7. The Eyelink Toolbox: eye tracking with MATLAB and the Psychophysics Toolbox.

Authors: Frans W Cornelissen; Enno M Peters; John Palmer
Journal: Behav Res Methods Instrum Comput Date: 2002-11

8. A comparison of auditory and visual distraction effects: behavioral and event-related indices.

Authors: S Berti; E Schröger
Journal: Brain Res Cogn Brain Res Date: 2001-01

9. Brain activity index of distractibility in normal school-age children.

Authors: V Gumenyuk; O Korzyukov; K Alho; C Escera; E Schröger; R J Ilmoniemi; R Näätänen
Journal: Neurosci Lett Date: 2001-11-16 Impact factor: 3.046

10. Working memory controls involuntary attention switching: evidence from an auditory distraction paradigm.

Authors: Stefan Berti; Erich Schröger
Journal: Eur J Neurosci Date: 2003-03 Impact factor: 3.386

3 in total

1. Distraction by auditory novelty during reading: Evidence for disruption in saccade planning, but not saccade execution.

Authors: Martin R Vasilev; Fabrice Br Parmentier; Julie A Kirkby
Journal: Q J Exp Psychol (Hove) Date: 2021-01-12 Impact factor: 2.143

2. Distraction by violation of sensory predictions: Functional distinction between deviant sounds and unexpected silences.

Authors: Fabrice B R Parmentier; Alicia Leiva; Pilar Andrés; Murray T Maybery
Journal: PLoS One Date: 2022-09-06 Impact factor: 3.752

3. Distraction by deviant sounds: disgusting and neutral words capture attention to the same extent.

Authors: Fabrice B R Parmentier; Isabel Fraga; Alicia Leiva; Pilar Ferré
Journal: Psychol Res Date: 2019-05-03

3 in total