| Literature DB >> 36234583 |
Anna N Matsukatova1,2, Aleksandr I Iliasov1,2, Kristina E Nikiruy1, Elena V Kukueva1, Aleksandr L Vasiliev1, Boris V Goncharov1, Aleksandr V Sitnikov1,3, Maxim L Zanaveskin1, Aleksandr S Bugaev4, Vyacheslav A Demin1, Vladimir V Rylkov1,5, Andrey V Emelyanov1,4.
Abstract
Convolutional neural networks (CNNs) have been widely used in image recognition and processing tasks. Memristor-based CNNs accumulate the advantages of emerging memristive devices, such as nanometer critical dimensions, low power consumption, and functional similarity to biological synapses. Most studies on memristor-based CNNs use either software models of memristors for simulation analysis or full hardware CNN realization. Here, we propose a hybrid CNN, consisting of a hardware fixed pre-trained and explainable feature extractor and a trainable software classifier. The hardware part was realized on passive crossbar arrays of memristors based on nanocomposite (Co-Fe-B)x(LiNbO3)100-x structures. The constructed 2-kernel CNN was able to classify the binarized Fashion-MNIST dataset with ~ 84% accuracy. The performance of the hybrid CNN is comparable to the other reported memristor-based systems, while the number of trainable parameters for the hybrid CNN is substantially lower. Moreover, the hybrid CNN is robust to the variations in the memristive characteristics: dispersion of 20% leads to only a 3% accuracy decrease. The obtained results pave the way for the efficient and reliable realization of neural networks based on partially unreliable analog elements.Entities:
Keywords: convolutional neural network; memristor; nanocomposite; neuromorphic computing; resistive switching
Year: 2022 PMID: 36234583 PMCID: PMC9565409 DOI: 10.3390/nano12193455
Source DB: PubMed Journal: Nanomaterials (Basel) ISSN: 2079-4991 Impact factor: 5.719
Figure 1(a) Photo of a single silicon wafer with multiple 4 × 4 and 16 × 16 memristor crossbar arrays; (b) dark field TEM image of a single memristor of an array with denoted layers; (c) I–V characteristics of all utilized NC memristors with optimal x ≈ 23 at.% (x is the metal concentration in the LNO NC); (d) retention time of a single memristor from the crossbar array (only a few selected data points are presented for better visibility, dashed lines represent the average values of the resistances).
Figure 2Schematic illustration of (a) the proposed neural network architecture and (b) the hardware convolutional layer implementation on the memristive crossbar array.
Figure 3Software modeling of the convolutional neural network with (a) one and (b) two kernels (filters). The image parameters (standard or binarized image) as well as the kernel weight parameters (trainable or fixed weights) are specified in the graphs. Each curve was obtained 10 times; the mean values with their standard deviations are presented in the graphs.
Figure 4Comparison of the smoothed software modeling results to the hybrid NN ones with (a) one and (b) two kernels (filters). (c) Confusion matrix for the hybrid CNN with two filters, evaluated on the test dataset, with examples of the misclassified images for the classes with the most frequent mistakes. The maximum ideal values for the main diagonal items equal 100%.