适配PAICORE2.0的硬件编码转帧加速单元设计

为了解决北京大学脉冲神经网络芯片PAICORE2.0类脑终端系统中软件编码和转帧过程速度较慢的问题,提出一种硬件加速方法.通过增加硬件加速单元,将Xilinx ZYNQ的处理系统PS端串行执行的软件编码转帧过程转移到可编程逻辑PL端的数据通路中流水化并行执行.硬件加速单元主要包含高度并行的卷积单元、参数化的脉冲神经元和位宽平衡数据缓冲区等.实验结果表明,该方法在几乎不增加数据通路传输延迟的前提下,可以消除软件编码和转帧过程的时间开销.在CIFAR-10图像分类的例子中,与软件编码和转帧方法相比,硬件编码转帧模块仅增加9.3%的LUT、3.7%的BRAM、2.6%的FF、0.9%的LUTRAM、...

Full description

Saved in:
Bibliographic Details
Published in北京大学学报(自然科学版) Vol. 60; no. 5; pp. 786 - 798
Main Authors 丁亚伟, 曹健, 李琦彬, 冯硕, 杨辰涛, 王源, 张兴
Format Journal Article
LanguageChinese
Published 北京大学软件与微电子学院,北京 102600%北京大学集成电路学院,北京 100871%北京大学集成电路学院,北京 100871 20.09.2024
北京大学深圳研究生院集成微系统科学工程与应用重点实验室,深圳 518055
Subjects
Online AccessGet full text
ISSN0479-8023
DOI10.13209/j.0479-8023.2024.066

Cover

Abstract 为了解决北京大学脉冲神经网络芯片PAICORE2.0类脑终端系统中软件编码和转帧过程速度较慢的问题,提出一种硬件加速方法.通过增加硬件加速单元,将Xilinx ZYNQ的处理系统PS端串行执行的软件编码转帧过程转移到可编程逻辑PL端的数据通路中流水化并行执行.硬件加速单元主要包含高度并行的卷积单元、参数化的脉冲神经元和位宽平衡数据缓冲区等.实验结果表明,该方法在几乎不增加数据通路传输延迟的前提下,可以消除软件编码和转帧过程的时间开销.在CIFAR-10图像分类的例子中,与软件编码和转帧方法相比,硬件编码转帧模块仅增加9.3%的LUT、3.7%的BRAM、2.6%的FF、0.9%的LUTRAM、14.9%的DSP以及14.6%的功耗,却能够实现约8.72倍的推理速度提升.
AbstractList 为了解决北京大学脉冲神经网络芯片PAICORE2.0类脑终端系统中软件编码和转帧过程速度较慢的问题,提出一种硬件加速方法.通过增加硬件加速单元,将Xilinx ZYNQ的处理系统PS端串行执行的软件编码转帧过程转移到可编程逻辑PL端的数据通路中流水化并行执行.硬件加速单元主要包含高度并行的卷积单元、参数化的脉冲神经元和位宽平衡数据缓冲区等.实验结果表明,该方法在几乎不增加数据通路传输延迟的前提下,可以消除软件编码和转帧过程的时间开销.在CIFAR-10图像分类的例子中,与软件编码和转帧方法相比,硬件编码转帧模块仅增加9.3%的LUT、3.7%的BRAM、2.6%的FF、0.9%的LUTRAM、14.9%的DSP以及14.6%的功耗,却能够实现约8.72倍的推理速度提升.
Abstract_FL An edge computing system was designed by the spiking neural network chip PAICORE2.0 of Peking University,in conjunction with Xilinx ZYNQ. However,the software encoding and frame generation processes on the processing system (PS) side is slow and limits the performance of the system. Therefore,a hardware acceleration method is proposed. The software encoding and frame generation processes,which is serially executed on the PS side,is moved to the data path on the programmable logic (PL) side for pipelined parallel execution. The hardware acceleration unit mainly consists of highly parallel convolution units,parameterizable spiking neurons,width-balanced data buffers and other modules. The results show that the method removes the time overhead of software encoding and frame generation without increasing the data path transmission delay. In the example of CIFAR-10 image classification,compared with software encoding and frame generation,the hardware encoding and frame generation module results in only a marginal increase in resource utilization—9.3% more Look-Up Tables (LUTs),3.7% more Block RAMs (BRAMs),2.6% more flip-flops (FFs),0.9% more LUTRAMs,and 14.9% more digital signal processors (DSPs),as well as a 14.6% increase in power consumption. However,it achieves approximately an 8.72-fold improvement in inference speed.
Author 曹健
杨辰涛
李琦彬
丁亚伟
王源
冯硕
张兴
AuthorAffiliation 北京大学软件与微电子学院,北京 102600%北京大学集成电路学院,北京 100871%北京大学集成电路学院,北京 100871;北京大学深圳研究生院集成微系统科学工程与应用重点实验室,深圳 518055
AuthorAffiliation_xml – name: 北京大学软件与微电子学院,北京 102600%北京大学集成电路学院,北京 100871%北京大学集成电路学院,北京 100871;北京大学深圳研究生院集成微系统科学工程与应用重点实验室,深圳 518055
Author_FL CAO Jian
YANG Chentao
ZHANG Xing
FENG Shuo
DING Yawei
LI Qibin
WANG Yuan
Author_FL_xml – sequence: 1
  fullname: DING Yawei
– sequence: 2
  fullname: CAO Jian
– sequence: 3
  fullname: LI Qibin
– sequence: 4
  fullname: FENG Shuo
– sequence: 5
  fullname: YANG Chentao
– sequence: 6
  fullname: WANG Yuan
– sequence: 7
  fullname: ZHANG Xing
Author_xml – sequence: 1
  fullname: 丁亚伟
– sequence: 2
  fullname: 曹健
– sequence: 3
  fullname: 李琦彬
– sequence: 4
  fullname: 冯硕
– sequence: 5
  fullname: 杨辰涛
– sequence: 6
  fullname: 王源
– sequence: 7
  fullname: 张兴
BookMark eNrjYmDJy89LZWCQNTTQMzQ2MrDUz9IzMDG31LUwMDLWMzIwMtEzMDNjYeCEC3Iw8BYXZyYZGBoZWViamRhyMpi_bGh62dob4Ojp7B_kaqRn8HxWy_OFa57s3vZ8z7TnCxpf7F3zdMfyp10LXjbMf9o79Wlr84t1-16sW8jDwJqWmFOcyguluRlC3FxDnD10ffzdPZ0dfXSLDQ2MjHSNDRMtTMwtDNOMTU2SzSxTjQ2NU1LTzBJNU0wMLEzNLICONk1MTjZJTjVKNk42MzQyM082MklMS7EwSDJJSTTmZlCHGFuemJeWmJcen5VfWpQHtDA-KSuloiIJ5EkDUwOg1wA5uVg4
ContentType Journal Article
Copyright Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
Copyright_xml – notice: Copyright © Wanfang Data Co. Ltd. All Rights Reserved.
DBID 2B.
4A8
92I
93N
PSX
TCJ
DOI 10.13209/j.0479-8023.2024.066
DatabaseName Wanfang Data Journals - Hong Kong
WANFANG Data Centre
Wanfang Data Journals
万方数据期刊 - 香港版
China Online Journals (COJ)
China Online Journals (COJ)
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
DocumentTitle_FL Design of Acceleration Unit of Encoding and Frame Generation for PAICORE2.0
EndPage 798
ExternalDocumentID bjdxxb202405002
GrantInformation_xml – fundername: 深圳市科技创新委员会基金
  funderid: (KQTD20200820113105004)
GroupedDBID -01
23M
2B.
4A8
5GY
8FE
8FH
92E
92I
93N
AAABJ
AAQEF
ABJNI
ABLSY
ABPYQ
ABUWG
ABVRV
ACECN
ACGFS
ACPRK
ACTRF
ADCJG
ADGMY
ADMLS
ADMQQ
ADRFT
ADZSZ
AENOO
AEXCR
AFKRA
AFSCH
AFTSM
AFZMG
AHIBC
AIVZI
AJZVN
ALMA_UNASSIGNED_HOLDINGS
BBNVY
BENPR
BHPHI
BPHCQ
BVBZV
CCEZO
CCPQU
CCVFK
CW9
HCIFZ
LK8
M7P
P2P
PDI
PHGZM
PHGZT
PMFND
PQQKQ
PSX
TCJ
TGP
U1G
U5K
UY8
ID FETCH-LOGICAL-s1022-31a84781f354c69e313def6a5d4085682095acc4ce2c3c61267c24afd80b4da3
ISSN 0479-8023
IngestDate Thu May 29 04:00:37 EDT 2025
IsPeerReviewed false
IsScholarly true
Issue 5
Keywords PAICORE2.0
卷积加速单元
spike encoding
convolutional acceleration unit
脉冲神经网络芯片
ZYNQ
硬件加速
spike neural network chip
hardware acceleration
脉冲编码
Language Chinese
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1022-31a84781f354c69e313def6a5d4085682095acc4ce2c3c61267c24afd80b4da3
PageCount 13
ParticipantIDs wanfang_journals_bjdxxb202405002
PublicationCentury 2000
PublicationDate 2024-09-20
PublicationDateYYYYMMDD 2024-09-20
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-09-20
  day: 20
PublicationDecade 2020
PublicationTitle 北京大学学报(自然科学版)
PublicationTitle_FL Acta Scientiarum Naturalium Universitatis Pekinensis
PublicationYear 2024
Publisher 北京大学软件与微电子学院,北京 102600%北京大学集成电路学院,北京 100871%北京大学集成电路学院,北京 100871
北京大学深圳研究生院集成微系统科学工程与应用重点实验室,深圳 518055
Publisher_xml – name: 北京大学软件与微电子学院,北京 102600%北京大学集成电路学院,北京 100871%北京大学集成电路学院,北京 100871
– name: 北京大学深圳研究生院集成微系统科学工程与应用重点实验室,深圳 518055
SSID ssib012289641
ssib051370299
ssj0030172
ssib001522812
ssib002258124
ssib000862120
ssib030194702
ssib008143590
ssib002040163
ssib006703675
ssib038076459
Score 2.4381413
Snippet 为了解决北京大学脉冲神经网络芯片PAICORE2.0类脑终端系统中软件编码和转帧过程速度较慢的问题,提出一种硬件加速方法.通过增加硬件加速单元,将Xilinx ZYNQ的处理系统PS端...
SourceID wanfang
SourceType Aggregation Database
StartPage 786
Title 适配PAICORE2.0的硬件编码转帧加速单元设计
URI https://d.wanfangdata.com.cn/periodical/bjdxxb202405002
Volume 60
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: Inspec with Full Text
  issn: 0479-8023
  databaseCode: ADMLS
  dateStart: 20230901
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  omitProxy: false
  ssIdentifier: ssj0030172
  providerName: EBSCOhost
– providerCode: PRVPQU
  databaseName: East & South Asia Database
  issn: 0479-8023
  databaseCode: BVBZV
  dateStart: 20170101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://search.proquest.com/eastsouthasia
  omitProxy: false
  ssIdentifier: ssj0030172
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  issn: 0479-8023
  databaseCode: BENPR
  dateStart: 20170101
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.proquest.com/central
  omitProxy: true
  ssIdentifier: ssj0030172
  providerName: ProQuest
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwxR3LbhMxcFWlFy6I8hBv5YAlENqw67W99tGbbFQQL0GReqv2laIegkRbqeqpPHpB6hEOXGhPXKoeERKPryFK-xfMeN3spq1E4IIUrSbeGc94ZuOZcdZjx7kRZkWW0iJzcd-my3rMdyVguj0lwP-wIChSU-3zoZh9xu7N8_mpxlbtraXVlbSVrZ-4r-RfrAptYFfcJfsXlh11Cg0Ag33hChaG60Q2JrHCNxUkNQAnsvNY320_ehLTlkfikChNJENA-0S3ScxIFJFIYEvUJsoAGuh9EksSdQwOJ5EkOkRAarxb8lBd09IhihsAPgFS6ZhEsQW0Xw91DRpwCQ1fbTvXzHaugZ2oAcKwg867KJuU2KcE8TQKCaMoxQZa5deo4JYyyCXVaFXD8JRmZIY5aAIBEKdboQiiQB3KCAoK4mN3OkTGRoeeldPop0IBGkF012q3PMLzcPmEMnzXg3qjB34yXZQ26NbsVA4iNuxjVDEKBI3cUoFwFbkiShFN4UdznN1tH6u6eYTyyURRqBpZmkUaNiPGIGVopJyUsQfZ8n9jXHN0LFQuFiKse-XylAk7-_Caiw0PS6eX38ozzI8FAgE1hXSXWqPOW2j9lieOFF43oVy6lK-tpYjgcVOadppCkOA1nGndeXD_6Vj-79N6vkHpWP0-cIm-qMfHHAPoysFhubtavC4xW6n-pvahNyWq_AScr2JhVZ8PD4eoF3_ifgB3cT2hDD0DXNDB0PNwzHbLIirjzkmqMDsV-72kv1gLqufOOKdtNtzU5dQ240ytPz_rzNh4Y7l50xbFv3XOCQ82Xh9sblXz2_Dj2-HO7q9vX4bfPwy3X-3_2B18_Tx4t32w8Wmw9X6w-WZ_7-f-3s55Z64bz7VnXXvoi7vsm51FfiJx-3sv4CwTqgj8IC96IuE51mIUkLAonmQZywqaBRnkZyLMKEt6ufRSlifBBafRf9EvLjpNKiX4GhakkIQynicpU4VXsCzPvQSsVFxymnbwC3ZOX1448iBc_jPKFedUNa1cdRorL1eLa5CnrKTX7dPzG0vt188
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%E9%80%82%E9%85%8DPAICORE2.0%E7%9A%84%E7%A1%AC%E4%BB%B6%E7%BC%96%E7%A0%81%E8%BD%AC%E5%B8%A7%E5%8A%A0%E9%80%9F%E5%8D%95%E5%85%83%E8%AE%BE%E8%AE%A1&rft.jtitle=%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%E5%AD%A6%E6%8A%A5%EF%BC%88%E8%87%AA%E7%84%B6%E7%A7%91%E5%AD%A6%E7%89%88%EF%BC%89&rft.au=%E4%B8%81%E4%BA%9A%E4%BC%9F&rft.au=%E6%9B%B9%E5%81%A5&rft.au=%E6%9D%8E%E7%90%A6%E5%BD%AC&rft.au=%E5%86%AF%E7%A1%95&rft.date=2024-09-20&rft.pub=%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%E8%BD%AF%E4%BB%B6%E4%B8%8E%E5%BE%AE%E7%94%B5%E5%AD%90%E5%AD%A6%E9%99%A2%2C%E5%8C%97%E4%BA%AC+102600%25%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%E9%9B%86%E6%88%90%E7%94%B5%E8%B7%AF%E5%AD%A6%E9%99%A2%2C%E5%8C%97%E4%BA%AC+100871%25%E5%8C%97%E4%BA%AC%E5%A4%A7%E5%AD%A6%E9%9B%86%E6%88%90%E7%94%B5%E8%B7%AF%E5%AD%A6%E9%99%A2%2C%E5%8C%97%E4%BA%AC+100871&rft.issn=0479-8023&rft.volume=60&rft.issue=5&rft.spage=786&rft.epage=798&rft_id=info:doi/10.13209%2Fj.0479-8023.2024.066&rft.externalDocID=bjdxxb202405002
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=http%3A%2F%2Fwww.wanfangdata.com.cn%2Fimages%2FPeriodicalImages%2Fbjdxxb%2Fbjdxxb.jpg