Literature DB >> 34086580

Custom Hardware Architectures for Deep Learning on Portable Devices: A Review.

Kh Shahriya Zaman, Mamun Bin Ibne Reaz, Sawal Hamid Md Ali, Ahmad Ashrif A Bakar, Muhammad Enamul Hoque Chowdhury.   

Abstract

The staggering innovations and emergence of numerous deep learning (DL) applications have forced researchers to reconsider hardware architecture to accommodate fast and efficient application-specific computations. Applications, such as object detection, image recognition, speech translation, as well as music synthesis and image generation, can be performed with high accuracy at the expense of substantial computational resources using DL. Furthermore, the desire to adopt Industry 4.0 and smart technologies within the Internet of Things infrastructure has initiated several studies to enable on-chip DL capabilities for resource-constrained devices. Specialized DL processors reduce dependence on cloud servers, improve privacy, lessen latency, and mitigate bandwidth congestion. As we reach the limits of shrinking transistors, researchers are exploring various application-specific hardware architectures to meet the performance and efficiency requirements for DL tasks. Over the past few years, several software optimizations and hardware innovations have been proposed to efficiently perform these computations. In this article, we review several DL accelerators, as well as technologies with emerging devices, to highlight their architectural features in application-specific integrated circuit (IC) and field-programmable gate array (FPGA) platforms. Finally, the design considerations for DL hardware in portable applications have been discussed, along with some deductions about the future trends and potential research directions to innovate DL accelerator architectures further. By compiling this review, we expect to help aspiring researchers widen their knowledge in custom hardware architectures for DL.

Entities:  

Year:  2021        PMID: 34086580     DOI: 10.1109/TNNLS.2021.3082304

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  2 in total

1.  Motion-Based Object Location on a Smart Image Sensor Using On-Pixel Memory.

Authors:  Wladimir Valenzuela; Antonio Saavedra; Payman Zarkesh-Ha; Miguel Figueroa
Journal:  Sensors (Basel)       Date:  2022-08-30       Impact factor: 3.847

2.  Emotion Recognition on Edge Devices: Training and Deployment.

Authors:  Vlad Pandelea; Edoardo Ragusa; Tommaso Apicella; Paolo Gastaldo; Erik Cambria
Journal:  Sensors (Basel)       Date:  2021-06-30       Impact factor: 3.576

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.