Convolutional neural networks on dataflow engines

Nils Voss, Marco Bacis, Oskar Mencer, Georgi Gaydadjiev, Wayne Luk

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

5 Citations (Scopus)


In this paper we discuss a high performance implementation for Convolutional Neural Networks (CNNs) inference on the latest generation of Dataflow Engines (DFEs). We discuss the architectural choices made during the design phase taking into account the DFE chip properties. We then perform design space exploration, considering the memory bandwidth and resources utilisation constraints derived from the used DFE and the chosen architecture. Finally, we discuss the high performance implementation and compare the obtained performance against other implementations, showing that our proposed design reaches 2,450 GOPS when running VGG16 as a test case.

Original languageEnglish
Title of host publicationProceedings - 35th IEEE International Conference on Computer Design, ICCD 2017
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages4
ISBN (Electronic)9781538622544
Publication statusPublished - 22 Nov 2017
Externally publishedYes
Event35th IEEE International Conference on Computer Design, ICCD 2017 - Boston, United States
Duration: 5 Nov 20178 Nov 2017


Conference35th IEEE International Conference on Computer Design, ICCD 2017
CountryUnited States


  • CNN
  • Deep Learning
  • DFE
  • DSE
  • FPGA
  • Inference


Dive into the research topics of 'Convolutional neural networks on dataflow engines'. Together they form a unique fingerprint.

Cite this