Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms

As the depth of DNN increases, the need for DNN calculations for the storage and computing power of the underlying computing platform is increasing. In this work, we implement an accelerator on FPGA for deep learning algorithms (CNN and RNN). The core computing module of the accelerator is a 32 * 32...

Full description

Saved in:

Bibliographic Details
Published in	Lecture notes in computer science Vol. 11276; pp. 153 - 158
Main Authors	Yang, Zhijie, Wang, Lei, Ding, Dong, Zhang, Xiangyu, Deng, Yu, Li, Shiming, Dou, Qiang
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 01.01.2018 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Accelerator Data mapping DNN Systolic array
Online Access	Get full text
ISBN	9783030056766 3030056767
ISSN	0302-9743 1611-3349 1611-3349
DOI	10.1007/978-3-030-05677-3_16

Cover

More Information
Summary:	As the depth of DNN increases, the need for DNN calculations for the storage and computing power of the underlying computing platform is increasing. In this work, we implement an accelerator on FPGA for deep learning algorithms (CNN and RNN). The core computing module of the accelerator is a 32 * 32 systolic array of PEs. A mapping method for variable size of CNN and RNN algorithms is proposed. The experiment result shows that the maximum power consumption of the accelerator is 7.5W@100Mhz, the peak performance is 0.2Tops/s, and the real performance is 7.6Mops@100Mhz when running the 1st layer of LeNet-5.
ISBN:	9783030056766 3030056767
ISSN:	0302-9743 1611-3349 1611-3349
DOI:	10.1007/978-3-030-05677-3_16