TruLook: A Framework for Configurable GPU Approximation
In this paper, we propose TruLook, a framework that employs approximate computing techniques for GPU acceleration through computation reuse as well as approximate arithmetic operations to eliminate redundant and unnecessary exact computations. To enable computational reuse, GPU is enhanced with smal...
Saved in:
| Published in | Proceedings - Design, Automation, and Test in Europe Conference and Exhibition pp. 487 - 490 |
|---|---|
| Main Authors | , , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
EDAA
01.02.2021
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1558-1101 |
| DOI | 10.23919/DATE51398.2021.9474239 |
Cover
| Summary: | In this paper, we propose TruLook, a framework that employs approximate computing techniques for GPU acceleration through computation reuse as well as approximate arithmetic operations to eliminate redundant and unnecessary exact computations. To enable computational reuse, GPU is enhanced with small lookup tables that are placed close to the stream cores that return already computed values for exact and potential inexact matches. Inexact matching is subject to a threshold controlled by the number of mantissa bits involved in the search. Approximate arithmetic is provided by a configurable approximate multiplier that dynamically detects and approximates operations which are not significantly affected by approximation. TruLook guarantees the accuracy bound required for an application by configuring the hardware at runtime. We have evaluated TruLook efficiency on a wide range of multimedia and deep learning applications. Our evaluation shows that with 0% and less than 1% quality loss budget, TruLook yields on average 2.1× and 5.6× energy-delay product improvement over four popular networks on the ImageNet dataset. |
|---|---|
| ISSN: | 1558-1101 |
| DOI: | 10.23919/DATE51398.2021.9474239 |