Philémon Roussel1,2, Gaël Le Godais1,2,3, Florent Bocquelet1,2, Marie Palma1,2, Jiang Hongjie4, Shaomin Zhang5, Anne-Lise Giraud6, Pierre Mégevand6,7, Kai Miller8, Johannes Gehrig9, Christian Kell9, Philippe Kahane10, Stéphan Chabardés11, Blaise Yvert1,2. 1. Inserm, BrainTech Lab, U1205, Grenoble, France. 2. University Grenoble Alpes, BrainTech Lab, U1205, Grenoble, France. 3. University Grenoble Alpes, Gipsa-Lab, Grenoble, France. 4. Zhejiang University, Department of Neurosurgery, Hangzhou, People's Republic of China. 5. Zhejiang University, Qiushi Academy for Advanced Studies, Hangzhou, People's Republic of China. 6. University of Geneva, Faculty of Medicine, Department of Basic Neuroscience, Geneva, Switzerland. 7. Geneva University Hospitals, Division of Neurology, Geneva, Switzerland. 8. Mayo Clinic, Department of Neurosurgery, Rochester, MN, United States of America. 9. Goethe University Frankfurt, Department of Neurology and Center for Personalized Translational Epilepsy Research, Frankfurt, Germany. 10. CHU Grenoble Alpes, Department of Neurology, Grenoble, France. 11. CHU Grenoble Alpes, Department of Neurosurgery, Grenoble, France.
Abstract
OBJECTIVE: A current challenge of neurotechnologies is to develop speech brain-computer interfaces aiming at restoring communication in people unable to speak. To achieve a proof of concept of such system, neural activity of patients implanted for clinical reasons can be recorded while they speak. Using such simultaneously recorded audio and neural data, decoders can be built to predict speech features using features extracted from brain signals. A typical neural feature is the spectral power of field potentials in the high-gamma frequency band, which happens to overlap the frequency range of speech acoustic signals, especially the fundamental frequency of the voice. Here, we analyzed human electrocorticographic and intracortical recordings during speech production and perception as well as a rat microelectrocorticographic recording during sound perception. We observed that several datasets, recorded with different recording setups, contained spectrotemporal features highly correlated with those of the sound produced by or delivered to the participants, especially within the high-gamma band and above, strongly suggesting a contamination of electrophysiological recordings by the sound signal. This study investigated the presence of acoustic contamination and its possible source. APPROACH: We developed analysis methods and a statistical criterion to objectively assess the presence or absence of contamination-specific correlations, which we used to screen several datasets from five centers worldwide. MAIN RESULTS: Not all but several datasets, recorded in a variety of conditions, showed significant evidence of acoustic contamination. Three out of five centers were concerned by the phenomenon. In a recording showing high contamination, the use of high-gamma band features dramatically facilitated the performance of linear decoding of acoustic speech features, while such improvement was very limited for another recording showing no significant contamination. Further analysis and in vitro replication suggest that the contamination is caused by the mechanical action of the sound waves onto the cables and connectors along the recording chain, transforming sound vibrations into an undesired electrical noise affecting the biopotential measurements. SIGNIFICANCE: Although this study does not per se question the presence of speech-relevant physiological information in the high-gamma range and above (multiunit activity), it alerts on the fact that acoustic contamination of neural signals should be proofed and eliminated before investigating the cortical dynamics of these processes. To this end, we make available a toolbox implementing the proposed statistical approach to quickly assess the extent of contamination in an electrophysiological recording (https://doi.org/10.5281/zenodo.3929296).
OBJECTIVE: A current challenge of neurotechnologies is to develop speech brain-computer interfaces aiming at restoring communication in people unable to speak. To achieve a proof of concept of such system, neural activity of patients implanted for clinical reasons can be recorded while they speak. Using such simultaneously recorded audio and neural data, decoders can be built to predict speech features using features extracted from brain signals. A typical neural feature is the spectral power of field potentials in the high-gamma frequency band, which happens to overlap the frequency range of speech acoustic signals, especially the fundamental frequency of the voice. Here, we analyzed human electrocorticographic and intracortical recordings during speech production and perception as well as a rat microelectrocorticographic recording during sound perception. We observed that several datasets, recorded with different recording setups, contained spectrotemporal features highly correlated with those of the sound produced by or delivered to the participants, especially within the high-gamma band and above, strongly suggesting a contamination of electrophysiological recordings by the sound signal. This study investigated the presence of acoustic contamination and its possible source. APPROACH: We developed analysis methods and a statistical criterion to objectively assess the presence or absence of contamination-specific correlations, which we used to screen several datasets from five centers worldwide. MAIN RESULTS: Not all but several datasets, recorded in a variety of conditions, showed significant evidence of acoustic contamination. Three out of five centers were concerned by the phenomenon. In a recording showing high contamination, the use of high-gamma band features dramatically facilitated the performance of linear decoding of acoustic speech features, while such improvement was very limited for another recording showing no significant contamination. Further analysis and in vitro replication suggest that the contamination is caused by the mechanical action of the sound waves onto the cables and connectors along the recording chain, transforming sound vibrations into an undesired electrical noise affecting the biopotential measurements. SIGNIFICANCE: Although this study does not per se question the presence of speech-relevant physiological information in the high-gamma range and above (multiunit activity), it alerts on the fact that acoustic contamination of neural signals should be proofed and eliminated before investigating the cortical dynamics of these processes. To this end, we make available a toolbox implementing the proposed statistical approach to quickly assess the extent of contamination in an electrophysiological recording (https://doi.org/10.5281/zenodo.3929296).
Authors: Guy H Wilson; Sergey D Stavisky; Francis R Willett; Donald T Avansino; Jessica N Kelemen; Leigh R Hochberg; Jaimie M Henderson; Shaul Druckmann; Krishna V Shenoy Journal: J Neural Eng Date: 2020-11-25 Impact factor: 5.379
Authors: Timothée Proix; Jaime Delgado Saa; Andy Christen; Stephanie Martin; Brian N Pasley; Robert T Knight; Xing Tian; David Poeppel; Werner K Doyle; Orrin Devinsky; Luc H Arnal; Pierre Mégevand; Anne-Lise Giraud Journal: Nat Commun Date: 2022-01-10 Impact factor: 17.694
Authors: Miguel Angrick; Maarten C Ottenhoff; Lorenz Diener; Darius Ivucic; Gabriel Ivucic; Sophocles Goulis; Jeremy Saal; Albert J Colon; Louis Wagner; Dean J Krusienski; Pieter L Kubben; Tanja Schultz; Christian Herff Journal: Commun Biol Date: 2021-09-23
Authors: Alan Bush; Anna Chrabaszcz; Victoria Peterson; Varun Saravanan; Christina Dastolfo-Hromack; Witold J Lipski; R Mark Richardson Journal: Neuroimage Date: 2022-02-02 Impact factor: 6.556
Authors: Maxime Verwoert; Maarten C Ottenhoff; Sophocles Goulis; Albert J Colon; Louis Wagner; Simon Tousseyn; Johannes P van Dijk; Pieter L Kubben; Christian Herff Journal: Sci Data Date: 2022-07-22 Impact factor: 8.501