Anu P Bhushan
June 23, 2021

Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)

While Part 1 of this 2-part blog series covered Deep Neural Networks and the different accelerators for implementing Deep Neural Network Models, Part 2 will talk about different Deep Learning Frameworks and hardware frameworks provided by FPGA Vendors.

Deep Learning Frameworks:

Deep learning framework can be considered as a tool or library that helps us to build DNN models quickly and easily without any in-depth knowledge of the underlying algorithms. It provides a condensed way for defining the models using pre-built and optimized components. Some of the important deep learning frameworks are Caffe, TensorFlow, Pytorch, Keras, etc.

Caffe is a deep neural network framework designed to improve speed and modularity. It is developed by Berkeley AI Research. Caffe mainly focuses on image processing applications involving convolutional neural networks (CNNs), but it also provides support for Region-based CNN, RNN, Long Short-term Memory and fully connected neural networks designs. It also supports CPU and GPU acceleration libraries such as NVIDIA cuDNN and Intel MKL. It provides support for C, C++, Python and MATLAB.

TensorFlow is a completely open-source deep learning framework which has pre-written code for deep learning models like RCNN and CNN. It was developed by researchers from Google. It has support for R, C++ and Python languages. It has a flexible architecture that allows deploying models across different platforms like CPU and GPU. TensorFlow works well on sequence-based data as well as on images. The latest version of TensorFlow is TensorFlow 2.0 which has significant improvements in performance on GPU.

Keras is an open-source framework that can run on top of TensorFlow. It is a high-level API which helps in fast experimentation of neural network models. Keras supports both CNN and RNN. It was developed by Francois Chollet, a Google engineer. Keras is written in python and it works perfectly on CPU as well as GPU.

PyTorch is an open-source machine learning library. It is developed by Facebook’s AI research lab and used for applications like computer vision, natural language processing etc. It has Python as well as C++ interface.

Hardware Frameworks for DNN:

FPGA as a hardware accelerator for Deep Neural Networks has its own advantages and disadvantages. One of the main challenges is that FPGA is programmed by describing functionalities using Hardware Description Language (HDL) like VHDL or Verilog. This is different from regular programming like C or C++. To reduce the complexity, tools exist like High-Level Synthesis (HLS) which synthesize high-level languages to HDL codes. Even though implementing neural network models defined in Caffe or TensorFlow frameworks are still complex as designers require in-depth knowledge in both machine learning frameworks as well as FPGA hardware, there are different hardware frameworks developed by FPGA vendors and other third-party companies to significantly reduce such complexity.

Some of the hardware frameworks that we cover here are OpenCL, Intel’s OpenVino, Xilinx DNNDK, Xilinx Vitis AI and Lattice sensAI stack.

Open Computing Language (OpenCL) is a heterogeneous framework for writing and executing programs on different computing platforms, including CPUs, GPUs, FPGAs, Digital Signal Processors (DSPs) and other hardware accelerators. It was launched in 2009 by Apple to utilise the acceleration possibilities of on-board GPU. The newest version is 3.0, which incorporated more C++ features to the language.

The OpenCL framework officially supports C and C++, but unofficial support is available for Python, Java, Perl and INET. An OpenCL implementation of a program is based around a host containing different computing devices, such as a CPU and a GPU, which is further divided into multiple processing elements. A function which is executed using OpenCL is called a kernel and can run in parallel on all processing elements. A programmer can utilise the acceleration capabilities available on a system by getting the device information from the computer the program is running on.

While OpenCL provides good possibilities for acceleration and resource usage, it is limited by its low-level nature. While it has functions for standard operations like FFT, neural networks have to be manually declared unless the frameworks used to generate the network have OpenCL branches. Caffe has such a branch, but it is currently under development. TensorFlow has an OpenCL-branch on its roadmap. The lack of neural network framework support limits its adoption. A more supported and similar framework to OpenCL is Nvidia’s CUDA, although this only runs on Nvidia GPUs.

OpenVINO toolkit is provided by Intel for running neural networks on FPGAs and aims to simplify the process compared to existing solutions. The OpenVINO toolkit was launched in 2018 and it allows users to program applications where neural networks can be accelerated on Intel processors, GPUs, FPGAs and Vision Processing Units (VPUs). The toolkit is compatible with different inference targets and varies between platforms.

OpenVINO is mainly used for accelerating image recognition CNNs but can be used for other purposes such as speech recognition. It supports frameworks such as Caffe and TensorFlow and deep learning architectures such as AlexNET and GoogleNET. It supports a set number of layers for each framework out of the box, with custom layer support available for developers.

In OpenVINO toolkit, the neural network models are optimised using Models Optimizer by taking the models files provided by the neural network framework, such as a caffemodel (from Caffe), with the calculated weights. The default model’s precision is single-precision floating-point, while quantisation to half-precision floating-point is available in the Optimizer. 8-bit integer quantisation is also available.

The Optimizer provides an optimised intermediate representation which is loaded into the code using the Inference Engine API. The API prepares and infers the network to the target device and runs the network with the supplied input data. All pre-processing and post-processing is done in C++, so the only part which has to be replaced is the inference or prediction process.

On an FPGA, OpenVINO uses a pre-loaded bitstream programmed onto the FPGA to accelerate instructions. It does not utilise HLS, but uses the FPGA as a specialised processor for performing mathematical operations found in neural networks, such as convolutions and activations. The OpenVINO bitstreams are fixed for an FPGA and do not allow customizations like adding other IO functions.

To compete with OpenVINO, Xilinx acquired Chinese developer DeePhi in 2018 and their neural network FPGA acceleration SDK Kit (DNNDK). The DNNDK SDK performs model pruning, quantisation and deployment on Xilinx FPGA development kits such as the Xilinx ZCU102, ZCU104 and Avnet Ultra96, along with some of DeePhi’s development kits.

Along with FPGAs, the systems have embedded MCUs, on the Xilinx devices called Multi-Processor System-on-Chip (MPSoC), with FPGA as Programmable Logic and MCU as Processor System (PS). DeePhi claims that the SDK is capable of accelerating CNNs as well as RNNs, achieving a speedup of 1.8x and 19x when compared to Application Specific Integrated Circuit (ASIC) and HLS-implementations of the same network, using 56x less power than the HLS implementation.

DNNDK tool kit utilizes a soft-core processor, the Deep-learning Processor Unit (DPU) to accelerate high computational tasks of DNN algorithms. The DPU is designed to support and accelerate common neural network designs, such as VGG, ResNet, GoogLeNet, YOLO, AlexNET, SSD and SqueezeNet, as well as custom networks. In contrast to OpenVINO, the FPGA image does not occupy the whole FPGA, leaving space for custom HDL code to run alongside the SDK. DNNDK is not available as a separate tool from September 2020. There will not be any new releases further. Xilinx has introduced a new version of a tool called Vitis AI for the deployment of DNN models.

Vitis AI is Xilinx’s latest development platform for DNN inference on Xilinx hardware such as edge devices and Alveo cards. It has tools, well-optimized IPs, models, libraries and example designs. It has the same development flow as DNNDK. It is developed with ease of use and efficiency in mind. Vitis AI also uses Deep Learning Processing Unit (DPU) for AI acceleration. DPU can be scaled to fit different Xilinx hardware Zynq®-7000 devices, Zynq UltraScale+ MPSoCs, and Alveo boards from edge to cloud to meet the requirements of many diverse applications.

Lattice sensAI is a full-featured stack that helps to evaluate, develop and deploy machine learning models in Lattice FPGAs provided by Lattice Semiconductor. It supports popular frameworks like Caffe, TensorFlow and Keras. They have IP cores specially designed to accelerate CNN models. They provide easy to implement, highly flexible, small and low power machine learning solutions.

FPGA Families Targeted for AI Acceleration:

FPGA vendors have optimized their FPGA families to specifically target AI Acceleration.

Intel® Stratix® 10 NX FPGA is Intel’s first AI-optimized FPGA. It embeds a new type of AI-optimized block, the AI Tensor Block, tuned for common matrix-matrix or vector-matrix multiplications.
Intel® Agilex™ FPGAs and SoCs deliver up to 40 percent higher performance or up to 40 percent lower power for applications in the data centre, networking, and edge compute.
Xilinx SoCs are an optimal solution for AI applications. They integrate a processor for software programmability and FPGA for hardware programmability providing scalability, flexibility and performance. They include cost-effective Zynq 7000 SoC and high end Zynq Ultrascale+ MPSoC, Zynq Ultrascale+ RFSoC.
Lattice Semiconductor provides FPGAs for machine learning applications which are easy to implement, low power and highly flexible. Their hardware platforms include iCE40 UltraPlus FPGA, ECP5 FPGA and CrossLink-NX.
Microchip has PolarFire SoC that is suitable for reliable, secure and power-efficient computations in Artificial Intelligence/Machine Learning (AI/ML), industrial automation, imaging and Internet of Things (IoT) etc

Summary:

FPGAs are now widely used in data centres for offloading GPU-based and CPU-based inference engines. These are early days in the definition, expansion and deployment of such capabilities starting from targeted FPGAs, model development and optimization frameworks and ecosystem of supported libraries. A rapid acceleration of capabilities of FPGAs is envisaged over the next five years to tackle a plethora of applications that could be deployed in the real world.

Read Part 1 here…

53 thoughts on “Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)”

Ernestbem
June 12, 2025 at 8:36 pm

I recently tried https://killakush.com/products/focus-gummies , and I’m extraordinarily impressed with the quality. The effects were smooth, calming, and exactly what I was hoping for. The miscellany of options also allowed me to find something flawless for both relaxing evenings and fecund days. Indubitably advise after anyone seeking great results!
Kod Binance
June 12, 2025 at 9:35 pm

Your point of view caught my eye and was very interesting. Thanks. I have a question for you.
RobertzeF
June 13, 2025 at 12:44 am

I’ve been exploring terpene-based products [url=https://terpenewarehouse.com/collections/diamante ]high terpene strains[/url] recently, and I’m remarkably enjoying the experience. The scents are rich, real, and pleasant. They add a nice touch to my daily drill, helping congeal the atmosphere and atmosphere. A brobdingnagian hit upon after anyone who appreciates savoury wellness tools.
Josephagito
June 13, 2025 at 11:29 pm

I’ve been exploring terpene-based products https://terpenewarehouse.com/ recently, and I’m indeed enjoying the experience. The scents are rich, real, and pleasant. They enlarge a outgoing be a match for to my day after day routine, plateful congeal the mood and atmosphere. A brobdingnagian hit upon quest of anyone who appreciates pungent wellness tools.
sativa tincture
June 25, 2025 at 4:38 pm

I’ve been using sativa tincture ordinary on account of during the course of a month for the time being, and I’m indeed impressed by the positive effects. They’ve helped me perceive calmer, more balanced, and less solicitous from the beginning to the end of the day. My snore is deeper, I wake up refreshed, and even my nave has improved. The value is distinguished, and I appreciate the natural ingredients. I’ll obviously carry on buying and recommending them to the whole world I recall!
Josephonepe
June 25, 2025 at 4:54 pm

vardenafil 10mg pill [url=https://levinevino.com]levitra buy uk[/url] generic levitra price
vardenafil teva generics vardenafil levitra 10mg levitra online australia
Hip Hop
June 26, 2025 at 3:53 pm

This is precisely the kind of intelligent and stimulating discussion that is desperately needed in the online sphere. It encourages genuine critical thinking rather than just passive consumption. Thank you for consistently raising the bar.
RichardPat
June 27, 2025 at 3:47 am

I’ve been exploring terpene-based products https://terpenewarehouse.com/collections/indica-terpenes recently, and I’m deep down enjoying the experience. The scents are well off, customary, and pleasant. They enlarge a outgoing drink to my always routine, helping set the mood and atmosphere. A great catch sight of for anyone who appreciates savoury wellness tools.
Hip Hop Music
June 30, 2025 at 1:56 am

Good post! We will be linking to this particularly great post on our site. Keep up the great writing
Düzce evden eve nakliyat
July 6, 2025 at 3:30 pm

This was beautiful Admin. Thank you for your reflections.
seo hizmeti
July 13, 2025 at 8:47 pm

I like the efforts you have put in this, regards for all the great content.
Edgardex
July 15, 2025 at 4:29 pm

alprazolam sans ordonnance: crГЁme emla ordonnance – tramadol sans ordonnance
Williamtreve
July 16, 2025 at 4:31 pm

https://tryggmed.com/# neglelim apotek
KennethTeeks
July 16, 2025 at 7:26 pm

apotek d vitamin: Trygg Med – promillemГҐler apotek
Michaeleveme
July 16, 2025 at 7:35 pm

https://snabbapoteket.com/# utslag bilder
Altonneody
July 16, 2025 at 8:19 pm

apotek snorking: Trygg Med – skjerm til hund apotek
Michaeleveme
July 17, 2025 at 1:07 am

https://tryggmed.com/# hudlim apotek
KennethTeeks
July 17, 2025 at 1:10 am

apotheke online: pseudoephedrine kopen in nederland – online apotheek – gratis verzending
ScottAverm
July 17, 2025 at 2:08 am

medicijnen apotheek [url=http://zorgpakket.com/#]online apotheek 24[/url] pil online bestellen
Altonneody
July 17, 2025 at 2:26 am

rГёyksyre apotek: mandelolje apotek – apotek koronatest
Williamtreve
July 17, 2025 at 3:09 am

https://tryggmed.shop/# legevakten apotek
Michaeleveme
July 17, 2025 at 6:36 am

https://zorgpakket.com/# ï»¿medicijnen bestellen
KennethTeeks
July 17, 2025 at 6:51 am

forstoppelse hund apotek: vann i Гёret apotek – nГҐl til ГҐ ta hull i Гёret apotek
ScottAverm
July 17, 2025 at 8:50 am

frenadol kopen in nederland [url=https://zorgpakket.com/#]Medicijn Punt[/url] digitale apotheek
Altonneody
July 17, 2025 at 8:52 am

internetapotheek nederland: apotheek winkel 24 review – aptoheek
Michaeleveme
July 17, 2025 at 1:23 pm

https://tryggmed.com/# apotek munnbind
KennethTeeks
July 17, 2025 at 2:14 pm

medicatie aanvragen: п»їmedicijnen bestellen – medicijn
Altonneody
July 17, 2025 at 4:16 pm

peppermynteolje kapsler apotek: bestille apotekvarer pГҐ nett – finasterid apotek
ScottAverm
July 17, 2025 at 4:37 pm

hur stavas te [url=http://snabbapoteket.com/#]SnabbApoteket[/url] hemma apotek
Williamtreve
July 17, 2025 at 4:53 pm

https://zorgpakket.com/# mijn apotheek online
Michaeleveme
July 17, 2025 at 8:11 pm

https://zorgpakket.shop/# online apotheek zonder recept ervaringen
KennethTeeks
July 17, 2025 at 9:46 pm

pseudoephedrine kopen in nederland: Medicijn Punt – medicijne
Altonneody
July 17, 2025 at 11:35 pm

apotek pcr test: Snabb Apoteket – stГ¶dbГ¤lte
ScottAverm
July 18, 2025 at 12:28 am

medicijnen kopen zonder recept [url=https://zorgpakket.com/#]MedicijnPunt[/url] internetapotheek nederland
Michaeleveme
July 18, 2025 at 3:27 am

https://snabbapoteket.shop/# sluta Ã¤ta godis app
KennethTeeks
July 18, 2025 at 5:47 am

fast grГёnnsГҐpe apotek: krem apotek – rubbing alcohol apotek
Altonneody
July 18, 2025 at 7:17 am

gurkemeie tabletter apotek: Trygg Med – klikk og hent apotek
Williamtreve
July 18, 2025 at 8:37 am

https://tryggmed.com/# abortpille apotek
ScottAverm
July 18, 2025 at 8:37 am

apteka nl online [url=https://zorgpakket.com/#]MedicijnPunt[/url] digitale apotheek
Michaeleveme
July 18, 2025 at 10:28 am

https://snabbapoteket.com/# lika delar apotek
KennethTeeks
July 18, 2025 at 1:16 pm

apotek Г¶ppet dygnet runt: SnabbApoteket – kokosolja mot fГ¤stingar
LewisBex
July 18, 2025 at 3:06 pm

do pharmacy sell viagra: clonidine online pharmacy – loratadine online pharmacy
Davidphara
July 18, 2025 at 5:00 pm

reputable indian pharmacies [url=https://indiamedshub.com/#]india online pharmacy[/url] IndiaMedsHub
Robertfluot
July 18, 2025 at 5:03 pm

https://indiamedshub.com/# indian pharmacies safe
Bobbynew
July 18, 2025 at 7:41 pm

mail order pharmacy india: top 10 online pharmacy in india – reputable indian online pharmacy
Vernonagind
July 18, 2025 at 9:32 pm

http://medimexicorx.com/# mexican drugstore online
Robertfluot
July 18, 2025 at 10:25 pm

https://expresscarerx.org/# online pharmacy reviews propecia
Davidphara
July 18, 2025 at 11:04 pm

prescription drugs mexico pharmacy [url=http://medimexicorx.com/#]best prices on finasteride in mexico[/url] tadalafil mexico pharmacy
Bobbynew
July 19, 2025 at 1:07 am

gabapentin mexican pharmacy: buy kamagra oral jelly mexico – buy meds from mexican pharmacy
LewisBex
July 19, 2025 at 2:37 am

lamotrigine online pharmacy: amitriptyline pharmacy – Starlix
Robertfluot
July 19, 2025 at 3:35 am

https://indiamedshub.shop/# best india pharmacy

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

info@ignitarium.com

Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)

Deep Learning Frameworks:

Hardware Frameworks for DNN:

FPGA Families Targeted for AI Acceleration:

Summary:

53 thoughts on “Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)”

Leave a Comment

Stay informed

NEWS & VIEWS

Join our team

APPLY

PRIVACY POLICY

©2025 Ignitarium Technology Solutions, All Rights Reserved

Newsletter

An ISO 9001:2015 certified company

Great Place to Work® Certified

We are a leading provider of Product Engineering Services, offering expertise in Semiconductor design, Multimedia & Imaging, Connectivity, Cloud & Enterprise solutions, and Machine Learning & Deep Neural Networks

Semiconductor

Software

Ecosystem

Resources

Contact Us

Request for Video

info@ignitarium.com

Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)

Deep Learning Frameworks:

Hardware Frameworks for DNN:

FPGA Families Targeted for AI Acceleration:

Summary:

53 thoughts on “Hardware Acceleration of Deep Neural Network Models on FPGA (Part 2 of 2)”

Leave a Comment

Stay informed

NEWS & VIEWS

Join our team

APPLY

PRIVACY POLICY

©2025 Ignitarium Technology Solutions, All Rights Reserved

Newsletter

An ISO 9001:2015 certified company

Great Place to Work® Certified

We are a leading provider of Product Engineering Services, offering expertise in Semiconductor design, Multimedia & Imaging, Connectivity, Cloud & Enterprise solutions, and Machine Learning & Deep Neural Networks

Semiconductor

Software

Ecosystem

Resources

Contact Us

Human Pose Detection & Classification

Features:

Target Markets:

OCR / Pattern Recognition

Use cases :

Highlights :

Behavior Monitoring

Use cases :

Highlights :

Attire & PPE Detection

Use cases :

Use cases :

Request for Video

Real Time Color Detection​

Use cases :

Highlights :

Missing Artifact Detection

Use cases :

Highlights :

Real Time Manufacturing Line Inspection

Use cases :

Highlights :

Ground Based Infrastructure analytics

Use cases :

Highlights :

Aerial Analytics

Use cases :

Highlights :

SANJAY JAYAKUMAR

Request Free Demo

RAMESH EMANI

​Manoj Thandassery

MALAVIKA GARIMELLA​

PRADEEP KUMAR LAKSHMANAN

SONA MATHEW

ASHWIN RAMACHANDRAN

AZIF SALY

RAJU KUNNATH

PRADEEP SUKUMARAN

SUJEET SREENIVASAN

RAJIN RAVIMONY

SIBY ABRAHAM

SUDIP NANDY

SUJEETH JOSEPH

SUJITH MATHEW IYPE

RAMESH SHANMUGHAM

Real Time Color Detection

Manoj Thandassery

MALAVIKA GARIMELLA