Structured and tiled-based pruning of Deep Learning models targeting FPGA implementations

Gonzalez-Carabarin, Lizeth; Schmid, Alexandre; van Sloun, Ruud J. G.

doi:10.1109/ISCAS48785.2022.9937748

Gonzalez-Carabarin, Lizeth; Schmid, Alexandre; van Sloun, Ruud J. G.

2022

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Model compression techniques have lead to a reduction of size and number of computations of Deep Learning models. However, techniques such as pruning mostly lack of a real co-optimization with hardware platforms. For instance, implementing unstructured pruning in dedicated hardware is not a straightforward task, which increases memory and reduces the effective bandwidth usage. Moreover, such pruning algorithms should be adapted to certain hardware requirements, such as the use of tiling. Therefore, in this work, we leverage the use of the Gumbel-Softmax relaxation sampling to structurally prune tiles, which benefits further hardware implementations, and additionally allows to jointly optimize with quantization. Additionally, we show that the combination of different pruning scenarios leads to a larger sparsity. Finally, we demonstrate the benefit of using structured pruning on fine-grained elements (weights) in an FPGA design.

Details

Title Structured and tiled-based pruning of Deep Learning models targeting FPGA implementations

Author(s) Gonzalez-Carabarin, Lizeth ; Schmid, Alexandre ; van Sloun, Ruud J. G.

Published in 2022 Ieee International Symposium On Circuits And Systems (Iscas 22)

Series IEEE International Symposium on Circuits and Systems

Pages 1392-1396

Conference IEEE International Symposium on Circuits and Systems (ISCAS), May 28-Jun 01, 2022, Austin, TX

Date 2022-01-01

Publisher New York, IEEE

ISSN 0271-4302

ISBN 978-1-6654-8485-5

Keywords

structured pruning; tiling; fpga; deep learning

DOI https://doi.org/10.1109/ISCAS48785.2022.9937748

Other identifier(s) View record in Web of Science

Laboratories SCI-STI-AXS

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > SCI-STI-AXS - SCI STI AXS Group
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2023-05-08