TruLook: A Framework for Configurable GPU Approximation

In this paper, we propose TruLook, a framework that employs approximate computing techniques for GPU acceleration through computation reuse as well as approximate arithmetic operations to eliminate redundant and unnecessary exact computations. To enable computational reuse, GPU is enhanced with smal...

Full description

Saved in:
Bibliographic Details
Published inProceedings - Design, Automation, and Test in Europe Conference and Exhibition pp. 487 - 490
Main Authors Garcia, Ricardo, Asgarinejad, Fatemeh, Khaleghi, Behnam, Rosing, Tajana, Imani, Mohsen
Format Conference Proceeding
LanguageEnglish
Published EDAA 01.02.2021
Subjects
Online AccessGet full text
ISSN1558-1101
DOI10.23919/DATE51398.2021.9474239

Cover

More Information
Summary:In this paper, we propose TruLook, a framework that employs approximate computing techniques for GPU acceleration through computation reuse as well as approximate arithmetic operations to eliminate redundant and unnecessary exact computations. To enable computational reuse, GPU is enhanced with small lookup tables that are placed close to the stream cores that return already computed values for exact and potential inexact matches. Inexact matching is subject to a threshold controlled by the number of mantissa bits involved in the search. Approximate arithmetic is provided by a configurable approximate multiplier that dynamically detects and approximates operations which are not significantly affected by approximation. TruLook guarantees the accuracy bound required for an application by configuring the hardware at runtime. We have evaluated TruLook efficiency on a wide range of multimedia and deep learning applications. Our evaluation shows that with 0% and less than 1% quality loss budget, TruLook yields on average 2.1× and 5.6× energy-delay product improvement over four popular networks on the ImageNet dataset.
ISSN:1558-1101
DOI:10.23919/DATE51398.2021.9474239