DATE 2018

Error Resilience Analysis for Systematically Employing Approximate Computing in Convolutional Neural Networks

Muhammad Abdullah Hanif^1,a , Rehan Hafiz² and Muhammad Shafique^1,b
¹Institute of Computer Engineering, Vienna University of Techoology, Vienna, Austria
^amuhammad.hanif@tuwien.ac.at
^bmuhammad.shafique@tuwien.ac.at
²Department of Electrical Engineering, Information Techoology University, Lahore, Pakistan
rehan.hafiz@itu.edu.pk

ABSTRACT

Approximate computing is an emerging paradigm for error n:silient applications as it leverages accuracy loss for improving power, energy, area, and/or performance of an application. The spectrum of error resilient applications includes the domains of Image and video processing, Artificial Intelligence (AI) and Machine Learning (ML), data IIJiaiytia,and other Reeogo!Uoo, Mlnlng, and Synthesis (RMS) applleatioos.In this work, we address one of the most challenging question, i.e., bow to systematically employ approximate computing in Convolution Neural Networks (CNNs), which are one of the most compute-intensive and the pivotal part of AI. Towards this, we prop01e a methodology to systematically analyze error resilience of deep CNNs and identify parameters that can be exploited for impro'ring performance/efficiency of these networks for inference purposes. We also present a case study for significance-driven claJsification of filters for different convolutional layers, and propose to prune those having the least sigDilleaooe, and thereby eosbling accuracy vs. efilciency tradeollli by exploiting their resilience characteristics in a systematic way.

Full Text (PDF)