Literature DB >> 29363603

The eardrums move when the eyes move: A multisensory effect on the mechanics of hearing.

Kurtis G Gruters1,2,3, David L K Murphy1,2,3, Cole D Jenson1,2,3, David W Smith4, Christopher A Shera5,6, Jennifer M Groh7,2,3.   

Abstract

Interactions between sensory pathways such as the visual and auditory systems are known to occur in the brain, but where they first occur is uncertain. Here, we show a multimodal interaction evident at the eardrum. Ear canal microphone measurements in humans (n = 19 ears in 16 subjects) and monkeys (n = 5 ears in three subjects) performing a saccadic eye movement task to visual targets indicated that the eardrum moves in conjunction with the eye movement. The eardrum motion was oscillatory and began as early as 10 ms before saccade onset in humans or with saccade onset in monkeys. These eardrum movements, which we dub eye movement-related eardrum oscillations (EMREOs), occurred in the absence of a sound stimulus. The amplitude and phase of the EMREOs depended on the direction and horizontal amplitude of the saccade. They lasted throughout the saccade and well into subsequent periods of steady fixation. We discuss the possibility that the mechanisms underlying EMREOs create eye movement-related binaural cues that may aid the brain in evaluating the relationship between visual and auditory stimulus locations as the eyes move.
Copyright © 2018 the Author(s). Published by PNAS.

Entities:  

Keywords:  EMREO; middle ear muscles; otoacoustic emissions; reference frame; saccade

Mesh:

Year:  2018        PMID: 29363603      PMCID: PMC5819440          DOI: 10.1073/pnas.1717948115

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Visual information can aid hearing, such as when lip reading cues facilitate speech comprehension. To derive such benefits, the brain must first link visual and auditory signals that arise from common locations in space. In species with mobile eyes (e.g., humans, monkeys), visual and auditory spatial cues bear no fixed relationship to one another but change dramatically and frequently as the eyes move, about three times per second over an 80° range of space. Accordingly, considerable effort has been devoted to determining where and how the brain incorporates information about eye movements into the visual and auditory processing streams (1). In the primate brain, all of the regions previously evaluated have shown some evidence that eye movements modulate auditory processing [inferior colliculus (2–6), auditory cortex (7–9), parietal cortex (10–12), and superior colliculus (13–17)]. Such findings raise the question of where in the auditory pathway eye movements first impact auditory processing. In this study, we tested whether eye movements affect processing in the auditory periphery. The auditory periphery possesses at least two means of tailoring its processing in response to descending neural control (Fig. 1). First, the middle ear muscles (MEMs), the stapedius and tensor tympani, attach to the ossicles that connect the eardrum to the oval window of the cochlea. Contraction of these muscles tugs on the ossicular chain, modulating middle ear sound transmission and moving the eardrum. Second, within the cochlea, the outer hair cells (OHCs) are mechanically active and modify the motion of both the basilar membrane and, through mechanical coupling via the ossicles, the eardrum [i.e., otoacoustic emissions (OAEs)]. In short, the actions of the MEMs and OHCs affect not only the response to incoming sound but also transmit vibrations backward to the eardrum. Both the MEMs and OHCs are subject to descending control by signals from the central nervous system (reviewed in refs. 18–20), allowing the brain to adjust the cochlear encoding of sound in response to previous or ongoing sounds in either ear and based on global factors, such as attention (21–27). The collective action of these systems can be measured in real time with a microphone placed in the ear canal (28). We used this technique to study whether the brain sends signals to the auditory periphery concerning eye movements, the critical information needed to reconcile the auditory spatial and visual spatial worlds.
Fig. 1.

Motile cochlear OHCs expand and contract in a way that depends both on the incoming sound and on the descending input received from the superior olivary complex in the brain. OHC motion moves the basilar membrane, and subsequently the eardrum, via fluid/mechanical coupling of these membranes through the ossicular chain. The MEMs also pull on the ossicles, directly moving the eardrum. These muscles are innervated by motor neurons near the facial and trigeminal nerve nuclei, which receive input from the superior olive bilaterally. In either case, eardrum motion can be measured with a microphone in the ear canal.

Motile cochlear OHCs expand and contract in a way that depends both on the incoming sound and on the descending input received from the superior olivary complex in the brain. OHC motion moves the basilar membrane, and subsequently the eardrum, via fluid/mechanical coupling of these membranes through the ossicular chain. The MEMs also pull on the ossicles, directly moving the eardrum. These muscles are innervated by motor neurons near the facial and trigeminal nerve nuclei, which receive input from the superior olive bilaterally. In either case, eardrum motion can be measured with a microphone in the ear canal.

Results

The Eardrums Move with Saccades.

Sixteen humans executed saccades to visual targets varying in horizontal position (Fig. 2). Half of the trials were silent, whereas the other half incorporated a series of task-irrelevant clicks presented before, during, and at two time points after the saccades (Fig. 2). This allowed us to compare the effects of eye movement-related neural signals on the auditory periphery in silence as well as in the presence of sounds often used to elicit OAEs.
Fig. 2.

Experimental design and results. (A) Recordings for all subjects were made via a microphone (Mic) in the ear canal set into a custom-fit ear bud. On each trial, a subject fixated on a central LED and then made a saccade to a target LED (−24° to +24° horizontally in 6° increments and 6° above the fixation point) without moving his/her head. The ±24° locations were included on only 4.5% of trials and were excluded from analysis (); other target locations were equally likely (∼13% frequency). (B) Humans (black text) received randomly interleaved silent and click trials (50% each). Clicks were played via a sound transducer coupled with the microphone at four times during these trials: during the initial fixation and saccade and at 100 ms and 200 ms after target fixation. Monkeys’ trials had minor timing differences (red text), and all trials had one click at 200–270 ms after target fixation (red click trace). (C) Average eye trajectories for one human subject and session for each of the included targets are shown; colors indicate saccade target locations from ipsilateral (blue) to contralateral (red). deg, degrees; Horiz., horizontal; Vert., vertical. Mean eye position is shown as a function of time, aligned on saccade onset (D) and offset (E). H, horizontal; V, vertical. Mean microphone recordings of air pressure in the ear canal, aligned on saccade onset (F) and offset (G), indicate that the eardrum oscillates in conjunction with eye movements. The phase and amplitude of the oscillation varied with saccade direction and amplitude, respectively. The oscillations were, on average, larger when aligned on saccade onset than when aligned on saccade offset.

Experimental design and results. (A) Recordings for all subjects were made via a microphone (Mic) in the ear canal set into a custom-fit ear bud. On each trial, a subject fixated on a central LED and then made a saccade to a target LED (−24° to +24° horizontally in 6° increments and 6° above the fixation point) without moving his/her head. The ±24° locations were included on only 4.5% of trials and were excluded from analysis (); other target locations were equally likely (∼13% frequency). (B) Humans (black text) received randomly interleaved silent and click trials (50% each). Clicks were played via a sound transducer coupled with the microphone at four times during these trials: during the initial fixation and saccade and at 100 ms and 200 ms after target fixation. Monkeys’ trials had minor timing differences (red text), and all trials had one click at 200–270 ms after target fixation (red click trace). (C) Average eye trajectories for one human subject and session for each of the included targets are shown; colors indicate saccade target locations from ipsilateral (blue) to contralateral (red). deg, degrees; Horiz., horizontal; Vert., vertical. Mean eye position is shown as a function of time, aligned on saccade onset (D) and offset (E). H, horizontal; V, vertical. Mean microphone recordings of air pressure in the ear canal, aligned on saccade onset (F) and offset (G), indicate that the eardrum oscillates in conjunction with eye movements. The phase and amplitude of the oscillation varied with saccade direction and amplitude, respectively. The oscillations were, on average, larger when aligned on saccade onset than when aligned on saccade offset. We found that the eardrum moved when the eyes moved, even in the absence of any externally delivered sounds. Fig. 2 shows the average eye position as a function of time for each target location for the human subjects on trials with no sound stimulus, aligned with respect to (w.r.t.) saccade onset (Fig. 2) and offset (Fig. 2). The corresponding average microphone voltages are similarly aligned and color-coded for saccade direction and amplitude (Fig. 2 ). The microphone readings oscillated, time-locked to both saccade onset and offset with a phase that depended on saccade direction. When the eyes moved toward a visual target contralateral to the ear being recorded, the microphone voltage deflected positively, indicating a change in ear canal pressure, beginning about 10 ms before eye movement onset. This was followed by a more substantial negative deflection at about 5 ms after the onset of the eye movement, after which additional oscillatory cycles occurred. The period of the oscillation is typically about 30 ms (∼33 Hz). For saccades in the opposite (ipsilateral) direction, the microphone signal followed a similar pattern but in the opposite direction: The initial deflection of the oscillation was negative. The amplitude of the oscillations appears to vary with the amplitude of the saccades, with larger saccades associated with larger peaks than those occurring for smaller saccades. Comparison of the traces aligned on saccade onset vs. saccade offset reveals that the oscillations continue into at least the initial portion of the period of steady fixation that followed saccade offset. The eye movement-related eardrum oscillation (EMREO) observed following saccade offset was similar in form to that observed during the saccade itself when the microphone traces were aligned on saccade onset, maintaining their phase and magnitude dependence on saccade direction and length. The fact that this postsaccadic continuation of the EMREO is not seen when the traces are aligned on saccade onset suggests a systematic relationship between the offset of the movement and the phase of the EMREO, such that variation in saccade duration obscures the ongoing nature of the EMREO. To obtain a portrait of the statistical significance of the relationship between the direction and amplitude of the eye movements and the observed microphone measurements of ear canal pressure across time, we calculated a regression of microphone voltage vs. saccade target location for each 0.04-ms sample from 25 ms before to 100 ms after saccade onset. The regression was conducted separately for each individual subject. Since this involved many repeated statistical tests, we compared the real results with a Monte Carlo simulation in which we ran the same analysis but scrambled the relationship between each trial and its true saccade target location (details are provided in ). As shown in Fig. 3, the slope of the regression involving the real data (red trace, averaged across subjects) frequently deviates from zero, indicating a relationship between saccade amplitude and direction and microphone voltage. The value of the slope oscillates during the saccade period, beginning 9 ms before and continuing until about 60 ms after saccade onset, matching the oscillations evident in the data in Fig. 2. In contrast, the scrambled data trace (Fig. 3, gray) is flat during this period. Similarly, the effect size (R2 values) of the real data deviates from the scrambled baseline in a similar but slightly longer time frame, dropping back to the scrambled data baseline at 75–100 ms after saccade onset (Fig. 3). Fig. 3 shows the percentage of subjects showing a statistically significant (P < 0.05) effect of saccade target location at each time point. This curve reaches 80–100% during the peaks of the oscillations observed at the population level. A similar, although weaker, dependence of the EMREO on target location was observed when the recordings were synchronized to saccade offset (Fig. 3 ). We repeated this test in an additional dataset (human dataset II; ) involving finer grained sampling within a hemifield (i.e., analyzing target locations within the contralateral or ipsilateral hemifield). This analysis confirmed the relationship between the microphone voltage and saccade amplitude, as well as saccade direction (Fig. S1).
Fig. 3.

Regression results for data aligned to saccade onset (A, C, and E) and offset (B, D, and F). (A and B) Mean ± SEM slope of regression of microphone voltage vs. saccade target location (conducted separately for each subject and then averaged across the group) at each time point for real (red) vs. scrambled (gray) data. In the scrambled Monte Carlo analysis, the true saccade target locations were shuffled and arbitrarily assigned to the microphone traces for individual trials. (C and D) Proportion of variance (Var.) accounted for by regression fit (R2). (E and F) Percentage of subject ears showing P < 0.05 for the corresponding time point. Additional details are provided in .

Regression results for data aligned to saccade onset (A, C, and E) and offset (B, D, and F). (A and B) Mean ± SEM slope of regression of microphone voltage vs. saccade target location (conducted separately for each subject and then averaged across the group) at each time point for real (red) vs. scrambled (gray) data. In the scrambled Monte Carlo analysis, the true saccade target locations were shuffled and arbitrarily assigned to the microphone traces for individual trials. (C and D) Proportion of variance (Var.) accounted for by regression fit (R2). (E and F) Percentage of subject ears showing P < 0.05 for the corresponding time point. Additional details are provided in .

EMREO Phase Is Related to Relative, Not Absolute, Saccade Direction.

Comparison of the EMREOs in subjects who had both ears recorded confirmed that the direction of the eye movement relative to the recorded ear determines the phase pattern of the EMREO. Fig. 4 shows the results for one such subject. When the left ear was recorded, leftward eye movements were associated with a positive voltage peak at saccade onset (Fig. 4, blue traces), whereas when the right ear was recorded, that same pattern occurred for rightward eye movements (Fig. 4, blue traces). Translating this into eardrum motion, the implication is that for a given moment during a given saccade, the eardrums bulge inward in one ear while bulging outward in the other ear. Which ear does what when is determined by the direction of the eye movement relative to the recorded ear (contralaterality/ipsilaterality), not whether it is directed to the left vs. right in space. The other subjects tested with both ears showed similar patterns, and those tested with one ear were also generally consistent (although there were some individual idiosyncrasies in timing and amplitude; the remaining individual subject data are shown in Fig. S2).
Fig. 4.

Recordings in left (A) and right (B) ears in an individual human subject. The eye movement-related signals were similar in the two ears when saccade direction is defined with respect to the recorded ear. The remaining individual subjects’ data can be found in Fig. S2. Contra, contralateral; Ipsi, ipsilateral; Mic, microphone.

Recordings in left (A) and right (B) ears in an individual human subject. The eye movement-related signals were similar in the two ears when saccade direction is defined with respect to the recorded ear. The remaining individual subjects’ data can be found in Fig. S2. Contra, contralateral; Ipsi, ipsilateral; Mic, microphone.

Relating the EMREO Signal to Eardrum Displacement.

At the low frequencies characteristic of EMREO oscillations (∼30 Hz), mean eardrum displacement is directly proportional to the pressure produced in the small volume enclosed between the microphone and the eardrum (∼1 cc for humans). We converted the EMREO voltage signal to pressure using the microphone calibration curve. At frequencies below about 200 Hz, the nominal microphone sensitivity quoted by the manufacturer does not apply, and so we used the complex frequency response measured by Christensen et al. (29) (details are provided in ). We found that the group average EMREO had peak-to-peak pressure changes near ∼42 mPa for 18° contralateral saccades, as well as an initial phase opposite to the microphone voltage (Fig. 5, maximum excursion of dark red traces). Using ear canal dimensions typical of the adult ear (30), this pressure, the equivalent of about 57-dB peak-equivalent sound pressure level (SPL), corresponds to a mean peak-to-peak eardrum displacement of ∼4 nm. Thus, the inferred initial deflection of the eardrum was always opposite to the direction of the saccade: When the eyes moved left, the eardrums moved right, and vice versa.
Fig. 5.

Estimated EMREO pressure for human subjects at saccade onset (A and C) and saccade offset (B and D). Pressures were obtained from the measured microphone voltage using the microphone’s complex frequency response measured at low frequencies as described by Christensen et al. (29). Contra, contralateral; deg, degrees; Horiz., horizontal; Ipsi, ipsilateral.

Estimated EMREO pressure for human subjects at saccade onset (A and C) and saccade offset (B and D). Pressures were obtained from the measured microphone voltage using the microphone’s complex frequency response measured at low frequencies as described by Christensen et al. (29). Contra, contralateral; deg, degrees; Horiz., horizontal; Ipsi, ipsilateral.

EMREOs in Monkeys.

As noted earlier, eye movements are known to affect neural processing in several areas of the auditory pathway (2–9). Because this previous work was conducted using nonhuman primate subjects, we sought to determine whether EMREOs also occur in monkeys. We tested the ears of rhesus monkeys (Macaca mulatta; n = 5 ears in three monkeys) performing a paradigm involving a trial type that was a hybrid of those used for the humans. The hybrid trial type was silent until after the eye movement, after which a single click was delivered with a 200- to 270-ms variable delay. EMREOs were observed with timing and waveforms roughly similar to those observed in the human subjects, both at the population level (Fig. 6 ) and at the individual subject level (Fig. 6, individual traces are shown in Fig. S3). The time-wise regression analysis suggests that the monkey EMREO begins at about the time of the saccade and reaches a robust level about 8 ms later (Fig. 4 ).
Fig. 6.

Eye position (A), microphone signal of ear canal pressure (B), and results of point-by-point regression (C–E) for monkey subjects (n = 5 ears). All analyses were calculated in the same way as for the human data (Figs. 2 and 3). Contra, contralateral; deg, degrees; Ipsi, ipsilateral.

Eye position (A), microphone signal of ear canal pressure (B), and results of point-by-point regression (C–E) for monkey subjects (n = 5 ears). All analyses were calculated in the same way as for the human data (Figs. 2 and 3). Contra, contralateral; deg, degrees; Ipsi, ipsilateral.

Controls for Electrical Artifact.

Control experiments ruled out electrical artifacts as possible sources of these observations. In particular, we were concerned that the microphone’s circuitry could have acted as an antenna and could have been influenced by eye movement-related electrical signals in either the eye movement measurement system in monkeys (scleral eye coils) and/or some other component of the experimental environment: electrooculographic signals resulting from the electrical dipole of the eye or myogenic artifacts, such as electromyographic signals, originating from the extraocular, facial, or auricular muscles. If such artifacts contributed to our measurements, they should have continued to do so when the microphone was acoustically plugged without additional electrical shielding. Accordingly, we selected four subjects with prominent effects of eye movements on the microphone signal (Fig. 7) and repeated the test while the acoustic port of the microphone was plugged (Fig. 7). The eye movement-related effects were no longer evident when the microphone was acoustically occluded (Fig. 7). This shows that eye movement-related changes in the microphone signal stem from its capacity to measure acoustic signals rather than electrical artifacts. Additionally, EMREOs were not observed when the microphone was placed in a 1-mL syringe (the approximate volume of the ear canal) and positioned directly behind the pinna of one human subject (Fig. 7). In this configuration, any electrical contamination should be similar to that in the regular experiment. We saw none, supporting the interpretation that the regular ear canal microphone measurements are detecting eardrum motion and not electrical artifact.
Fig. 7.

EMREOs recorded normally (A) are not observed when the microphone input port is plugged to eliminate acoustic but not electrical (B) contributions to the microphone signal. The plugged microphone sessions were run as normal sessions except that after calibration, the microphone was placed in a closed earbud before continuing with the session (n = 4 subjects). (C) Similarly, EMREOs were not evident when the microphone was placed in a test tube behind a human subject’s pinna while he/she performed the experiment as normal. Contra, contralateral; Ipsi, ipsilateral.

EMREOs recorded normally (A) are not observed when the microphone input port is plugged to eliminate acoustic but not electrical (B) contributions to the microphone signal. The plugged microphone sessions were run as normal sessions except that after calibration, the microphone was placed in a closed earbud before continuing with the session (n = 4 subjects). (C) Similarly, EMREOs were not evident when the microphone was placed in a test tube behind a human subject’s pinna while he/she performed the experiment as normal. Contra, contralateral; Ipsi, ipsilateral.

EMREOs and Click Stimuli.

EMREOs occurred not only in silence but also when sounds were presented. During half of our trials for human subjects, acoustic clicks (65-dB peak-equivalent SPL) were delivered during the initial fixation, during the saccade to the target, and at both 100 ms and 200 ms after obtaining target fixation. Clicks presented during the saccade superimposed on the ongoing EMREO (Fig. 8). Subtraction of the clicks measured during the fixation period (Fig. 8) indicated that the EMREO was not appreciably changed by the presence of the click (Fig. 8). Unlike saccade onset or offset, the click did not appear to reset the phase of the EMREO.
Fig. 8.

Clicks do not appear to alter EMREOs. (Upper Row) Mean microphone signals for human subjects (A–D) and monkeys (E) for click presentations at various time points are shown: during the initial fixation (A), during the saccade (B, ∼20 ms after saccade onset), and 100 ms (C) and 200 ms (D) after target fixation was obtained in humans, and 200–270 ms after target fixation in monkeys (E). (Insets) Zoomed-in views of the periclick timing for a more detailed view (gray backgrounds). (Middle, B–D) Residual microphone signal after the first click in each trial was subtracted (human subjects) is shown. There were no obvious distortions in the EMREOs at the time of the (removed) click, suggesting that the effects of EMREOs interact linearly with incoming sounds. (Lower, A–E) Mean peak-to-peak amplitude of the clicks (mean ± SE) is shown. There were no clear differences in the peak click amplitudes for any epochs, indicating that the acoustic impedance did not change as a function of saccade target location. Contra, contralateral; Ipsi, ipsilateral.

Clicks do not appear to alter EMREOs. (Upper Row) Mean microphone signals for human subjects (A–D) and monkeys (E) for click presentations at various time points are shown: during the initial fixation (A), during the saccade (B, ∼20 ms after saccade onset), and 100 ms (C) and 200 ms (D) after target fixation was obtained in humans, and 200–270 ms after target fixation in monkeys (E). (Insets) Zoomed-in views of the periclick timing for a more detailed view (gray backgrounds). (Middle, B–D) Residual microphone signal after the first click in each trial was subtracted (human subjects) is shown. There were no obvious distortions in the EMREOs at the time of the (removed) click, suggesting that the effects of EMREOs interact linearly with incoming sounds. (Lower, A–E) Mean peak-to-peak amplitude of the clicks (mean ± SE) is shown. There were no clear differences in the peak click amplitudes for any epochs, indicating that the acoustic impedance did not change as a function of saccade target location. Contra, contralateral; Ipsi, ipsilateral. Clicks delivered after the eye movement was complete (Fig. 8 ) revealed no new sound-triggered effects attributable to the different static eye fixation positions achieved by the subjects by that point of the trial. Results in monkeys, for which only one postsaccadic click was delivered (at 75-dB peak-equivalent SPL), were similar (Fig. 8). Finally, we tested the peak-to-peak amplitude of the microphone signal corresponding to the click itself. Any changes in the recorded amplitude of a click despite the same voltage having been applied to the earbud speaker would indicate changes to the acoustic impedance of the ear as a function of saccade direction. We calculated the peak-to-peak amplitude of the click as a function of saccade target location, but found no clear evidence of any saccade target location-related differences during any of the temporal epochs (Fig. 8 ). If the mechanisms causing the eardrum to move produce concomitant changes in the dynamic stiffness of the eardrum, they are too small to be detected with this technique.

Discussion

Here, we demonstrated a regular and predictable pattern of oscillatory movements of the eardrum associated with movements of the eyes. When the eyes move left, both eardrums initially move right and then oscillate for three to four cycles, with another one to two cycles occurring after the eye movement is complete. Eye movements in the opposite direction produce an oscillatory pattern in the opposite direction. This EMREO is present in both human and nonhuman primates and appears similar across both ears within individual subjects. The impact of the eye movement-related process revealed here on subsequent auditory processing remains to be determined. In the case of OAEs, the emissions themselves are easily overlooked, near-threshold sounds in the ear canal. However, the cochlear mechanisms that give rise to them have a profound impact on the sensitivity and dynamic range of hearing (31, 32). OAEs thus serve as a biomarker for the health of this important mechanism. EMREOs may constitute a similar biomarker indicating that information about the direction, amplitude, and time of occurrence of saccadic eye movements has reached the periphery and that such information is affecting underlying mechanical processes that may not themselves be readily observed. EMREO assessment may therefore have value in understanding auditory and visual-auditory deficits that contribute to disorders ranging from language impairments to autism and schizophrenia, as well as multisensory deficits associated with traumatic brain injuries. One important unresolved question is whether the oscillatory nature of the effect observed here reflects the most important aspects of the phenomenon. It may be that the oscillations reflect changes to internal structures within the ear that persist statically until the eye moves again. If so, then the impact on auditory processing would not be limited to the period of time when the eardrum is actually oscillating. Instead, processing could be affected at all times, updating to a new state with each eye movement. If the oscillations themselves are the most critical element of the phenomenon, and auditory processing is only affected during these oscillations, then it becomes important to determine for what proportion of the time they occur. In our paradigm, we were able to demonstrate oscillations occurring over a total period of ∼110 ms, based on the aggregate results from aligning on saccade onset and offset. They may continue longer than that, but with a phase that changes in relation to some unmeasured or uncontrolled variable. As time passes after the eye movement, the phase on individual trials might drift such that the oscillation disappears from the across-trial average. However, with the eyes moving about three times per second under normal conditions, an EMREO duration of about 110 ms for each saccade corresponds to about one-third of the time in total. Regardless, evidence arguing for a possible influence on auditory processing can be seen in the size of EMREOs. A head-to-head comparison of EMREOs with OAEs is difficult since auditory detection thresholds are higher in the frequency range of EMREOs than in the range of OAEs, but EMREOs appear to be at least comparable and possibly larger than OAEs. For 18° horizontal saccades, EMREOs produce a maximum peak-equivalent pressure of about 57-dB SPL, whereas click-evoked OAEs range from ∼0- to 20-dB SPL in healthy ears. The similar or greater scale of EMREOs supports the interpretation that they too likely reflect underlying mechanisms of sufficient strength to make a meaningful contribution to auditory processing. In whatever fashion EMREOs, or their underlying mechanism, contribute to auditory processing, we surmise that the effect concerns the localization of sounds with respect to the visual scene or using eye movements. Determining whether a sight and a sound arise from a common spatial position requires knowing the relationship between the visual eye-centered and auditory head-centered reference frames. Eye position acts as a conversion factor from eye- to head-centered coordinates (1), and eye movement-related signals have been identified in multiple auditory-responsive brain regions (2–17). The direction/phase and amplitude of EMREOs contain information about the direction and amplitude of the accompanying eye movements, which situates them well for playing a causal role in this process. Note that we view the EMREO as most likely contributing to normal and accurate sound localization behavior. Sounds can be accurately localized using eye movements regardless of initial eye position (33), and this is true even for brief sounds presented when the eyes are in flight (34). Although some studies have found subtle influences of eye position on sound perception tasks (35–42), such effects are only consistent when prolonged fixation is involved (43–45). Holding the eyes steady for a seconds to minutes represents a substantial departure from the time frame in which eye movements normally occur. It is possible that deficiencies in the EMREO system under such circumstances contribute to these inaccuracies. The source of the signals that cause EMREOs is not presently known. Because the EMREOs precede, or occur simultaneously, with actual movement onset, it appears likely that they derive from a copy of the motor command to generate the eye movement rather than a proprioceptive signal from the orbits, which would necessarily lag the actual eye movement. This centrally generated signal must then affect the activity of either MEMs or OHCs, or a combination of both. Based on similarities between our recordings and known physiological activity of the MEMs, we presume that the mechanism behind this phenomenon is most likely the MEMs. Specifically, the frequency (around 20–40 Hz) is similar to oscillations observed in previous recordings of both the tensor tympani and stapedius muscles (46–51). Furthermore, it seems unlikely, given the measured fall-off in reverse middle ear transmission at low frequencies (52), that OHC activity could produce ear canal sounds of the magnitude observed (42-mPa peak-to-peak or 57-dB peak-equivalent SPL). Although MEMs are often associated with bilaterally attenuating loud environmental and self-generated sounds (which their activity may precede), they are also known to be active in the absence of explicit auditory stimuli, particularly during rapid eye movement sleep (53–56) and nonacoustic startle reflexes (51, 57–61), and have been found to exhibit activity associated with movements of the head and neck in awake cats (57, 62). (This latter observation is worthy of further investigation; perhaps there is also a head movement-related eardrum oscillation to facilitate conversion of auditory information into a body-centered reference frame). More generally, we have demonstrated that multisensory interactions occur at the most peripheral possible point in the auditory system and that this interaction is both systematic and substantial. This observation builds on studies showing that attention, either auditory-guided or visually guided, can also modulate the auditory periphery (21–27). Our findings also raise the intriguing possibility that efferent pathways in other sensory systems [e.g., those leading to the retina (63–73)] also carry multisensory information to help refine peripheral processing. This study contributes to an emerging body of evidence suggesting that the brain is best viewed as a dynamic system in which top-down signals modulate feed-forward signals; that is, the brain integrates top down information early in sensory processing to make the best-informed decision about the world with which it has to interact.

Methods

The data in this study are available at https://figshare.com.

Human Subjects and Experimental Paradigm.

Human dataset I.

Human subjects (n = 16, eight females, aged 18–45 y; participants included university students as well as young adults from the local community) were involved in this study. All procedures involving human subjects were approved by the Duke University Institutional Review Board. Subjects had apparently normal hearing and normal or corrected vision. Informed consent was obtained from all participants before testing, and all subjects received monetary compensation for participation. Stimulus (visual and auditory) presentation, data collection, and offline analysis were run on custom software utilizing multiple interfaces [behavioral interface and visual stimulus presentation: Beethoven software (eye position sampling rate: 500 Hz), Ryklin, Inc.; auditory stimulus presentation and data acquisition: Tucker Davis Technologies (microphone sampling rate: 24.441 kHz); and data storage and analysis: MATLAB; MathWorks]. Subjects were seated in a dark, sound-attenuating room. Head movements were minimized using a chin rest, and eye movements were tracked with an infrared camera (EyeLink 1000 Plus). Subjects performed a simple saccade task (Fig. 2 ). The subject initiated each trial by obtaining fixation on an LED located at 0° in azimuth and elevation and about 2 m away. After 200 ms of fixation, the central LED was extinguished and a target LED located 6° above the horizontal meridian and ranging from −24° to 24°, in intervals of 6°, was illuminated. The most eccentric targets (±24°) were included despite being slightly beyond the range at which head movements normally accompany eye movements, which is typically 20° (74, 75), because we found in preliminary testing that including these locations improved performance for the next most eccentric locations (i.e., the ±18° targets). However, because these (±24°) locations were difficult for subjects to perform well, we presented them on only 4.5% of the trials (in comparison to 13% for the other locations) and excluded them from analysis. After subjects made saccades to the target LED, they maintained fixation on it (9° window diameter) for 250 ms until the end of the trial. If fixation was dropped (i.e., if the eyes traveled outside of the 9° window at any point throughout the initial or target fixation period), the trial was terminated and the next trial began. On half of the trials, task-irrelevant sounds were presented via the earphones of an earphone/microphone assembly (Etymotic 10B+ microphone with ER 1 headphone driver) placed in the ear canal and held in position through a custom-molded silicone earplug (Radians, Inc.). These sounds were acoustic clicks at 65-dB peak-equivalent SPL, produced by brief electrical pulses (40-μs positive monophasic pulse) and were presented at four time points within each trial: during the initial fixation period (100 ms after obtaining fixation), during the saccade (∼20 ms after initiating an eye movement), 100 ms after obtaining fixation on the target, and 200 ms after obtaining fixation on the target. Acoustic signals from the ear canal were recorded via the in-ear microphone throughout all trials, and were recorded from one ear in 13 subjects (left/right counterbalanced) and from both ears in separate sessions in the other three subjects, for a total of n = 19 ears tested. Testing for each subject ear was conducted in two sessions over two consecutive days or within the same day but separated by a break of at least 1 h between sessions. Each session involved about 600 trials and lasted a total of about 30 min. The sound delivery and microphone system was calibrated at the beginning of every session using a custom script (MATLAB) and again after every block of 200 trials. The calibration routine played a nominal 80-dB SPL sound (a click, a broadband burst, and a series of tones at frequencies ranging from 1–12 kHz in 22 steps) in the ear canal and recorded the resultant sound pressure. It then calculated the difference between the requested and measured sound pressures and calculated a gain adjustment profile for all sounds tested. Auditory recording levels were set with a custom software calibration routine (MATLAB) at the beginning of each data collection block (200 trials). All conditions were randomly interleaved.

Human dataset II.

Additional data were collected from eight ears of four new subjects (aged 20–27 y, three females and one male, one session per subject) to verify that the relationship between microphone voltage and saccade amplitude/target location held up when sampled more finely within a hemifield. All procedures were the same as in the first dataset with three exceptions: (i) The horizontal target positions ranged from −20° to +20° in 4° increments (the vertical component was still 6°), (ii) the stimuli were presented from a color LCD monitor (70 cm × 49 cm) at a distance of 85 cm, and (iii) no clicks were played during trials. The microphone signal was also calibrated differently: Clicks played between trials revealed the stability of its impulse response and overall gain during and across experimental sessions. This helped identify decreasing battery power to the amplifiers, occlusion of the microphone barrel, or a change in microphone position. A Focusrite Scarlett 2i2 audio interface was used for auditory stimulus presentation and data acquisition (microphone sampling rate of 48 kHz). Stimulus presentation and data acquisition were controlled through custom MATLAB scripts using The Psychophysics Toolbox extension (76–78) and the Eyelink MATLAB toolbox (79). These data are presented in Fig. S1, and the remaining human figures are derived from dataset I.

Monkey Subjects and Experimental Paradigm.

All procedures conformed to the guidelines of the National Institutes of Health (NIH publication no. 86-23, revised 1985) and were approved by the Institutional Animal Care and Use Committee of Duke University. Monkey subjects (n = 3, all female) underwent aseptic surgical procedures under general anesthesia to implant a head post holder to restrain the head and a scleral search coil (Riverbend Eye Tracking System) to track eye movements (80, 81). After recovery with suitable analgesics and veterinary care, monkeys were trained in the saccade task described above for the human subjects. Two monkeys were tested with both ears in separate sessions, whereas the third monkey was tested with one ear, for a total of n = 5 ears tested. The trial structure was similar to that used in human dataset I but with the following differences (Fig. 2, red traces): (i) Eye tracking was done with a scleral eye coil; (ii) task-irrelevant sounds were presented on all trials, but only one click was presented at 200–270 ms after the saccade to the visual target (the exact time jittered in this range, and was therefore at the same time or slightly later than the timing of the fourth click for human subjects); (iii) the ±24° targets were presented in an equal proportion to the other target locations but were similarly excluded from analysis as above, and there were additional targets at ±9° that were also discarded as they were not used with the human subjects; (iv) initial fixation was 110–160 ms (jittered time range), while target fixation duration was 310–430 ms; (v) monkeys received a fluid reward for correct trial performance; (vi) disposable plastic ear buds containing the earphone/microphone assembly as above were placed in the ear canal for each session; (vii) auditory recording levels were set at the beginning of each data collection session (the ear bud was not removed during a session, and therefore no recalibration occurred within a session); (viii) sessions were not divided into blocks (the monkeys typically performed consistently throughout an entire session and dividing it into blocks was unnecessary); and (ix) the number of trials per session was different for monkeys vs. humans (which were always presented exactly 1,200 trials over the course of the entire study), and the actual number of trials performed varied based on monkey’s performance tolerance and capabilities for the day. Monkeys MNN012 and MYY002 were both run for four sessions per ear (no more than one session per day) over the course of 2 wk; MNN012 correctly performed an average of 889 of 957 trials at included target locations per session for both ears, while MYY002 correctly performed 582 of 918 trials per session on average. Monkey MHH003 was only recorded for 1 d and yielded 132 correct out of 463 trials. Despite this much lower performance, visual inspection of the data suggested they were consistent with other subject data and were therefore included for analysis. It is worth highlighting that the effect reported in this study can be seen, in this case, with comparatively few trials.

Control Sessions.

To verify that the apparent effects of eye movements on ear-generated sounds were genuinely acoustic in nature and did not reflect electrical contamination from sources such as myogenic potentials of the extraocular muscles or the changing orientation of the electrical dipole of the eyeball, we ran a series of additional control studies. Some subjects (n = 4 for plugged microphone control, n = 1 for syringe control) were invited back as participants in these control studies. In the first control experiment, the microphone was placed in the ear canal and subjects performed the task but the microphone’s input port was physically plugged, preventing it from detecting acoustic signals (Fig. 7). This was accomplished by placing the microphone in a custom ear mold in which the canal-side opening for the microphone was blocked. Thus, the microphone was in the same physical position during these sessions, and should therefore have continued to be affected by any electrical artifacts that might be present, but its acoustic input was greatly attenuated by the plug. Four subject ears were retested in this paradigm in a separate pair of data collection sessions from their initial “normal” sessions. The control sessions were handled exactly as the normal sessions except that the plugged ear mold was used to replace the regular open ear mold after the microphone was calibrated in the open ear. Calibration was executed exactly as in the normal sessions before plugging the ear mold. In the second control, trials were run exactly as a normal session except that the microphone was set into a 1-mL syringe [approximately the average volume of the human ear canal (82)] using a plastic ear bud and the syringe was placed on top of the subject’s ear behind the pinna. Acoustic recordings were taken from within the syringe while a human subject executed the behavioral paradigm exactly as normal. Sound levels were calibrated to the syringe at the start of each block.

Data Analysis.

Initial processing.

Unless specifically stated otherwise, all data are reported using the raw voltage recorded from the Etymotic microphone system. Results for individual human and monkey subjects were based on all of a subject’s correct and included trials (with ±24° and 9° target locations excluded, the average number of correct trials per human subject ear for each session was 150, for a total of 900 across the six sessions).

Trial exclusion criteria.

Trial exclusion criteria were based on saccade performance and microphone recording quality. For saccade performance, trials were excluded if: (i) the reaction time to initiate the saccade was >250 ms or (ii) the eyes deviated from the path between the fixation point and the target by more than 9°. These criteria resulted in the exclusion of 18.5 ± 11.1% per ear recorded. Microphone recording quality was examined for click and no-click trials separately, but followed the same guidelines. The mean and SD of microphone recordings over a whole block were calculated from all successful trials (after exclusions based on saccade performance). If the SD of the voltage values of a given trial was more than threefold the SD of the voltage values across the whole block, it was excluded. Sample-wise z-scores were calculated relative to the mean and SD of successful trials within a block. Trials with 50 samples or more with z-scores in excess of 10 were also excluded. These requirements removed trials containing noise contamination from bodily movements (e.g., swallowing, heavy breaths) or from external noise sources (acoustic and electric). These two criteria excluded an average of 1.7 ± 1.5% of trials that had passed the saccade exclusion criteria. Overall, 20% of trials were excluded because of saccade performance or recording quality.

Saccade-microphone synchronization.

Eye position data were resampled to the microphone sampling rate (from 500 Hz to 24.5 kHz) and smoothed to minimize the compounding error of nonmonotonic recording artifacts after each successive differentiation of eye position to calculate eye velocity, acceleration, and jerk (the time derivative of acceleration). Microphone data were synchronized to two time points in the saccades: initiation and termination. Saccade onsets were determined by locating the time of the first peak in the jerk of the saccades. These onset times for each trial were then used to synchronize the microphone recordings at the beginning of saccades. Because saccades of different lengths took different amounts of time to complete and because performance of saccades to identical targets varied between trials, saccade completions occurred over a range of tens of milliseconds. As a result, recordings of eardrum behavior related to the later stages of saccades would be poorly aligned if synchronized by saccade onset alone. Therefore, saccade termination, or offset, was determined by locating the second peak in the saccade jerk and used to synchronize microphone recordings with saccade completion.

Statistical analyses.

We evaluated the statistical significance of the effects of eye movements on signals recorded with the ear canal microphone with a regression analysis at each time point (microphone signal vs. saccade target location) for each subject. This produced a time series of statistical results (slope, R2, and P value; Fig. 3 ). This analysis was performed twice: first, with all trials synchronized to saccade onset and, second, with all trials synchronized to saccade offset. Given that the dependence of one sample relative to the next sample was unknown, a post hoc correction (e.g., Bonferroni correction) was not practical. Instead, we used a Monte Carlo technique to estimate the chance-related effect size and false-positive rates of this test. We first scrambled the relationship between each trial and its saccade target location assignment and then ran the same analysis as before. This provided an estimate of how our results should look if there was no relationship between eye movements and the acoustic recordings. Peak click amplitudes were calculated for each trial involving sounds by isolating the maximum and minimum peaks during the click stimuli. This allowed us to look for possible changes in acoustic impedance in the ear canal.

Estimation of eardrum motion.

The sensitivity of the microphone used in the experiments (Etymotic Research ER10B+) rolls off at frequencies below about 200 Hz, where it also introduces a frequency-dependent phase shift (29). Consequently, the microphone voltage waveform is not simply proportional to the ear canal pressure (or eardrum displacement) at frequencies characteristic of the EMREO waveform. To obtain the ear canal pressure, and estimate the eardrum motion that produced it, we converted the microphone voltage to pressure using published measurements of the microphone’s complex-valued frequency response (29). Because the measured frequency response (H, with units of volts per pascal) was sampled at 48 kHz, we first transformed it into the time-domain representation of the microphone’s impulse response, resampled the result at our sampling rate of 24.4 kHz, and then retransformed it back into the frequency domain as H. For each trial, the fast Fourier transform of microphone voltage (V) was divided by H and the inverse fast Fourier transform was then calculated to produce the estimate of ear-canal, P: Eardrum displacement, x, was computed from the measured pressure using the equation x = (V/A) P/ρ0c2, where V is the volume of the residual ear canal space (∼2 cc), A is the cross-sectional area of the eardrum (∼60 mm2), ρ0 is the density of air, and c is the speed of sound.
  74 in total

1.  NON-ACOUSTIC FACTORS INFLUENCING ACTIVITY OF MIDDLE EAR MUSCLES IN WAKING CATS.

Authors:  P W CARMEL; A STARR
Journal:  Nature       Date:  1964-04-11       Impact factor: 49.962

2.  Electromyographic studies of the middle ear muscles of the cat.

Authors:  S ELIASSON; L GISSELSSON
Journal:  Electroencephalogr Clin Neurophysiol       Date:  1955-08

3.  Timing and laminar profile of eye-position effects on auditory responses in primate auditory cortex.

Authors:  Kai-Ming G Fu; Ankoor S Shah; Monica N O'Connell; Tammy McGinnis; Haftan Eckholdt; Peter Lakatos; John Smiley; Charles E Schroeder
Journal:  J Neurophysiol       Date:  2004-07-28       Impact factor: 2.714

4.  Excitability of the acoustic m. stapedius and m. tensor tympani reflexes in the nonanesthetized rabbit.

Authors:  E Borg
Journal:  Acta Physiol Scand       Date:  1972-07

5.  The effects of histamine on rat and monkey retinal ganglion cells.

Authors:  Matthew J Gastinger; Rafail G Yusupov; Randolph D Glickman; David W Marshak
Journal:  Vis Neurosci       Date:  2004 Nov-Dec       Impact factor: 3.241

6.  Sleep deprivation and phasic activity of REM sleep: independence of middle-ear muscle activity from rapid eye movements.

Authors:  L De Gennaro; M Ferrara
Journal:  Sleep       Date:  2000-02-01       Impact factor: 5.849

7.  Retinopetal neurons located in the diencephalon of the Japanese monkey (Macaca fuscata).

Authors:  Abuduaini Abudureheman; Shiro Nakagawa
Journal:  Okajimas Folia Anat Jpn       Date:  2010-05

8.  Centrifugal fibers to the rat retina from the medial pretectal area and the periaqueductal grey matter.

Authors:  S K Itaya; P W Itaya
Journal:  Brain Res       Date:  1985-02-11       Impact factor: 3.252

9.  Histamine immunoreactive axons in the macaque retina.

Authors:  M J Gastinger; J J O'Brien; N B Larsen; D W Marshak
Journal:  Invest Ophthalmol Vis Sci       Date:  1999-02       Impact factor: 4.799

10.  Effect of eye position on saccades and neuronal responses to acoustic stimuli in the superior colliculus of the behaving cat.

Authors:  Luis C Populin; Daniel J Tollin; Tom C T Yin
Journal:  J Neurophysiol       Date:  2004-06-09       Impact factor: 2.714

View more
  7 in total

1.  Monkeys and humans implement causal inference to simultaneously localize auditory and visual stimuli.

Authors:  Jeff T Mohl; John M Pearson; Jennifer M Groh
Journal:  J Neurophysiol       Date:  2020-07-29       Impact factor: 2.714

Review 2.  Whistling While it Works: Spontaneous Otoacoustic Emissions and the Cochlear Amplifier.

Authors:  Christopher A Shera
Journal:  J Assoc Res Otolaryngol       Date:  2022-01-03

3.  Generalizability of clinically measured acoustic reflexes to brief sounds.

Authors:  Kristy K Deiters; Gregory A Flamme; Stephen M Tasko; William J Murphy; Nathaniel T Greene; Heath G Jones; William A Ahroon
Journal:  J Acoust Soc Am       Date:  2019-11       Impact factor: 1.840

4.  Time-domain and frequency-domain effects of tensor tympani contraction on middle ear sound transmission in gerbil.

Authors:  Liam Gallagher; Mohamed Diop; Elizabeth S Olson
Journal:  Hear Res       Date:  2021-04-08       Impact factor: 3.672

5.  Vestigial auriculomotor activity indicates the direction of auditory attention in humans.

Authors:  Daniel J Strauss; Farah I Corona-Strauss; Andreas Schroeer; Philipp Flotho; Ronny Hannemann; Steven A Hackley
Journal:  Elife       Date:  2020-07-03       Impact factor: 8.140

6.  Unity Assumption in Audiovisual Emotion Perception.

Authors:  Ka Lon Sou; Ashley Say; Hong Xu
Journal:  Front Neurosci       Date:  2022-03-04       Impact factor: 4.677

7.  Auditory Attention Reduced Ear-Canal Noise in Humans by Reducing Subject Motion, Not by Medial Olivocochlear Efferent Inhibition: Implications for Measuring Otoacoustic Emissions During a Behavioral Task.

Authors:  Nikolas A Francis; Wei Zhao; John J Guinan
Journal:  Front Syst Neurosci       Date:  2018-09-13
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.