| Literature DB >> 36263363 |
Lingli Cheng1,2,3, Lili Gao4, Xumeng Zhang2,5, Zuheng Wu6, Jiaxue Zhu1,3, Zhaoan Yu1,3, Yue Yang1,3, Yanting Ding2,5, Chao Li1,3, Fangduo Zhu2,5, Guangjian Wu2,5, Keji Zhou2,5, Ming Wang2,5, Tuo Shi4, Qi Liu1,2,5.
Abstract
Cochleas are the basis for biology to process and recognize speech information, emulating which with electronic devices helps us construct high-efficient intelligent voice systems. Memristor provides novel physics for performing neuromorphic engineering beyond complementary metal-oxide-semiconductor technology. This work presents an artificial cochlea based on the shallen-key filter model configured with memristors, in which one filter emulates one channel. We first fabricate a memristor with the TiN/HfOx/TaOx/TiN structure to implement such a cochlea and demonstrate the non-volatile multilevel states through electrical operations. Then, we build the shallen-key filter circuit and experimentally demonstrate the frequency-selection function of cochlea's five channels, whose central frequency is determined by the memristor's resistance. To further demonstrate the feasibility of the cochlea for system applications, we use it to extract the speech signal features and then combine it with a convolutional neural network to recognize the Free Spoken Digit Dataset. The recognition accuracy reaches 92% with 64 channels, compatible with the traditional 64 Fourier transform transformation points of mel-frequency cepstral coefficients method with 95% recognition accuracy. This work provides a novel strategy for building cochleas, which has a great potential to conduct configurable, high-parallel, and high-efficient auditory systems for neuromorphic robots.Entities:
Keywords: cochlea; configurable; filter; memristor; speech recognition
Year: 2022 PMID: 36263363 PMCID: PMC9574047 DOI: 10.3389/fnins.2022.982850
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
FIGURE 1Biological vs. bioinspired cochlear auditory recognition system. (A) Schematic of speech recognition in the biological cochlea system. (B) Speech signal recognition system using the memristor-based artificial cochleas.
FIGURE 2Device structure and electrical properties. (A) Film-stacked structure of 1T1R, consisting of a transistor and TiN/HfOx /TaOx /TiN with a TEM image. (B) Low panel quantifies the atomic profile of primary elements across the memristor from the EDS line scan upper panel. (C) The I-V characteristics of the 1T1R in 100 repeated DC sweeps during the set/reset processes. For the set process, the scan voltage of 0–1.7 V is applied to the TE with the voltage of 1.5 V applied to the gate; for the reset process, the scan voltage of 0–2.2 V is applied to the SE with the voltage of 4 V applied to the gate. (D) The I-V electrical characteristics of the device under pulsed scanning with set/reset process. During the set process, a pulse (2.2 V, 100 ns) is applied to TE terminal with Vg = 4 V; during the reset process, a pulse (4 V,100 ns) is applied to SE terminal with Vg = 4 V. Endurance results show the reliable HRS and LRS up to 5 × 105 cycles. (E) Multilevel resistance programming characteristic of the device under DC sweep. Vg increases from 1 to 2.5 V with a 0.05 V step. The inset shows the good linearity of the memristor under 0–0.1 V sweeping on TE. (F) Multi-resistance stability retention characteristics of the device.
FIGURE 3Bioinspired cochlea filter circuit and experimental results. (A) Circuit structure of bioinspired cochlea filter circuit based on memristor, where R1 = 1 MΩ, R2 = 100 MΩ, C1 = C2 = 40 pF. (B) The output response characteristics when the sinusoidal signal (0.2 V, 1,500 Hz) input to the circuit with the 44 kΩ memristor’s resistance. (C) Output signals when the input sinusoidal signal’s frequency increases from 1,000 to 3,400 Hz. (D) The amplitude-frequency characteristic curve of the memristor-based circuit when the memristor is programmed to 44 kΩ. (E) Multiple amplitude-frequency characteristic curves of the memristor-based filter circuit when the memristor is programmed to 86, 70, 44, 32, 26.7 kΩ, respectively. (F) Comparison diagram of the relationship between f0 and extracted from experimental and simulation results.
FIGURE 4Zero to nine digital audio recognition realized in the artificial cochlear system based on CNN neural network. (A) Illustration of the artificial cochlea speech recognition system. (B) Energy spectrum of digital 0 speech signal after feature extraction by 64-channel parallel filter circuits. (C) Schematic diagram of CNN speech signal recognition. (D) Network simulation flow chart. (E) Experimental and simulated recognition accuracy of 10 digital speech audio recognition under 32 and 64 channels.