Neuromorphic processor-oriented hybrid Q-format multiplication with adaptive quantization for tiny YOLO3

Deep neural networks (DNNs) have delivered unprecedented achievements in the modern Internet of Everything society, encompassing autonomous driving, expert diagnosis, unmanned supermarkets, etc. It continues to be challenging for researchers and engineers to develop a high-performance neuromorphic p...

Full description

Saved in:

Bibliographic Details
Published in	Neural computing & applications Vol. 35; no. 15; pp. 11013 - 11041
Main Authors	Li, Tao, Ma, Yitao, Endoh, Tetsuo
Format	Journal Article
Language	English
Published	London Springer London 01.05.2023 Springer Nature B.V
Subjects	Accuracy Algorithms Artificial Intelligence Artificial neural networks Brain research Communication Computational Biology/Bioinformatics Computational Science and Engineering Computer architecture Computer Science Computing time Convolution Data Mining and Knowledge Discovery Energy consumption Feature maps Fixed point arithmetic Format Hardware Image Processing and Computer Vision Measurement Memory Microprocessors Model accuracy Modules Multiplication Multiplication & division Network latency Neural networks Neurons Neurosciences Object recognition Original Article Power consumption Probability and Statistics in Computer Science Processors Representations Semiconductors Transistors Adaptive quantization Neuromorphic processor Fixed-point representation Hybrid multiplication DNNs
Online Access	Get full text
ISSN	0941-0643 1433-3058 1433-3058
DOI	10.1007/s00521-023-08280-y

Cover

More Information
Summary:	Deep neural networks (DNNs) have delivered unprecedented achievements in the modern Internet of Everything society, encompassing autonomous driving, expert diagnosis, unmanned supermarkets, etc. It continues to be challenging for researchers and engineers to develop a high-performance neuromorphic processor for deployment in edge devices or embedded hardware. DNNs’ superpower derives from their enormous and complex network architecture, which is computation-intensive, time-consuming, and energy-heavy. Due to the limited perceptual capacity of humans, accurate processing results from DNNs require a substantial amount of computing time, making them redundant in some applications. Utilizing adaptive quantization technology to compress the DNN model with sufficient accuracy is crucial for facilitating the deployment of neuromorphic processors in emerging edge applications. This study proposes a method to boost the development of neuromorphic processors by conducting fixed-point multiplication in a hybrid Q-format using an adaptive quantization technique on the convolution of tiny YOLO3. In particular, this work integrates the sign-bit check and bit roundoff techniques into the arithmetic of fixed-point multiplications to address overflow and roundoff issues within the convolution’s adding and multiplying operations. In addition, a hybrid Q-format multiplication module is developed to assess the proposed method from a hardware perspective. The experimental results prove that the hybrid multiplication with adaptive quantization on the tiny YOLO3’s weights and feature maps possesses a lower error rate than alternative fixed-point representation formats while sustaining the same object detection accuracy. Moreover, the fixed-point numbers represented by Q (6.9) have a suboptimal error rate, which can be utilized as an alternative representation form for the tiny YOLO3 algorithm-based neuromorphic processor design. In addition, the 8-bit hybrid Q-format multiplication module exhibits low power consumption and low latency in contrast to benchmark multipliers.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0941-0643 1433-3058 1433-3058
DOI:	10.1007/s00521-023-08280-y