Performance Optimization for MLP Accelerators using ILP-Based On-Chip Weight Allocation Strategy
It is generally impossible to store all weights into an MLP accelerator because of limited on-chip SRAM capacity. However, the performance can still be improved if a portion of weights are allocated in faster SRAM. In this paper, we first present an analytical method for performance evaluation under...
Saved in:
| Published in | Proceedings of Technical Program of International Symposium on VLSI Design, Automation and Test pp. 1 - 4 |
|---|---|
| Main Authors | , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
18.04.2022
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2472-9124 |
| DOI | 10.1109/VLSI-DAT54769.2022.9768095 |
Cover
| Summary: | It is generally impossible to store all weights into an MLP accelerator because of limited on-chip SRAM capacity. However, the performance can still be improved if a portion of weights are allocated in faster SRAM. In this paper, we first present an analytical method for performance evaluation under different weight allocation approaches. We then propose an ILP-based on-chip weight allocation strategy that can maximize the overall performance. Experiment results show that the proposed strategy constantly outperforms several trivial heuristic methods over a large set of various MLP models, MLP accelerator configurations, and on-chip SRAM capacities. |
|---|---|
| ISSN: | 2472-9124 |
| DOI: | 10.1109/VLSI-DAT54769.2022.9768095 |