Accelerating AI Inference on Hardware (FPGAs)

Overview

This repository provides resources for using Vitis AI, Xilinx's AI acceleration framework, and the Deep Learning Processing Unit (DPU) to efficiently run neural networks on FPGAs. The workflow demonstrates the deployment of a pre-trained ResNet-50 model using TensorFlow 2, optimization for FPGA deployment, and acceleration of deep learning inference.

Key Workflow Stages:

Quantization: Convert the floating-point model to an FPGA-compatible format, reducing size and improving performance.
Compilation: Use the Vitis AI Compiler to optimize the model for the DPU architecture.
Inference: Deploy the optimized model on FPGA hardware to achieve significant improvements in speed and energy efficiency.

Repository Contents

The repository includes the following files and resources:

inspect_float_model.py
- This script inspects the floating-point model to extract details such as:
  - Layers
  - Input shapes
  - Subgraphs
- Why Subgraphs Are Important:
  Subgraphs represent portions of the neural network that can be offloaded to the FPGA's DPU for hardware acceleration. The rest of the network executes on the CPU. This division helps maximize efficiency and performance.
quantize_model.py
- This script handles the quantization process, converting the floating-point ResNet-50 model into an 8-bit integer representation compatible with FPGA DPUs.
- Input Images for Quantization:
  The quantization process requires a dataset of representative input images. These images can be downloaded from standard datasets such as ImageNet.
analyse_subgraphs.sh
- This shell script analyzes the compiled model and retrieves detailed subgraph information.
- It helps verify which parts of the model are mapped to the DPU and provides insights for debugging or optimization.
inference_on_board.py
- This script runs inference on the FPGA hardware using the compiled model (Xmodel).
- Ensure paths are correctly configured for the model and dataset before running.
Model Files and Configuration
- Pre-trained ResNet-50 model (float and quantized versions).
- Compiled Xmodel for deployment.
Support Files
- Commands and configurations necessary to run Vitis AI workflows.
- Example scripts for setting up the FPGA board and executing tasks.

Instructions

1. Environment Setup

Install Vitis AI 3.5 and Docker (v19.03+).

2. Launch Vitis AI Docker Container

Use the following command to launch the tensorflow2 container:
```
./docker_run.sh xilinx/vitis-ai-tensorflow2-cpu:latest
```

3. Inspect the Float Model

python3 inspect_float_model.py

4.Quantise the model

python3 quantize_model.py

5. Compile the model for the FPGA (Here the model is combiled for Kria KV260, DPU DPUCZDX8G)

 vai_c_tensorflow2 -m quantized/quantized.h5 -a /opt/vitis_ai/compiler/arch/DPUCZDX8G/KV260/arch.json -o ./compiled -n compiled_model

FPGA Setup and Inference Deployment Guide

4. Set Up the FPGA Board

Flash the Vitis AI SD Card Image
- Download the pre-built Vitis AI SD card image for Kria KV260 from the official website. Kria Kv260
Boot the Board
- Insert the flashed SD card into the Kria KV260 board.
- Power on the board.
Establish Ethernet and Serial Connections
- Connect an Ethernet cable to the board to establish a network connection.
- Use a serial terminal (e.g., PuTTY /TeraTerm /Picocom) to connect to the board via a UART interface for debugging.

5. Deploy and Run Inference

Transfer the Compiled Model and Scripts to the FPGA
- Transfer the compiled model ,inference scripts and the data (images and the labels) to the Kria KV260 via SCP
  - Example command:
```
scp -r ./compiled root@<ip_address_of_fpga>:/home/root/workspace/
```
Run the Inference
- SSH into the Kria KV260 board and run the inference script:
```
ssh user@<FPGA_IP>
python3 inference_script.py
```
- The inference result will be printed in the terminal. Ensure that the model and script are correctly configured
References

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Codes		Codes
Float_Model		Float_Model
Inspection_result		Inspection_result
compiled		compiled
quantized		quantized
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accelerating AI Inference on Hardware (FPGAs)

Overview

Key Workflow Stages:

Repository Contents

Instructions

1. Environment Setup

2. Launch Vitis AI Docker Container

3. Inspect the Float Model

4.Quantise the model

5. Compile the model for the FPGA (Here the model is combiled for Kria KV260, DPU DPUCZDX8G)

FPGA Setup and Inference Deployment Guide

4. Set Up the FPGA Board

5. Deploy and Run Inference

References

About

Uh oh!

Releases

Packages

Languages

License

NPJ-NPJ/FPGA-AI-Deployment

Folders and files

Latest commit

History

Repository files navigation

Accelerating AI Inference on Hardware (FPGAs)

Overview

Key Workflow Stages:

Repository Contents

Instructions

1. Environment Setup

2. Launch Vitis AI Docker Container

3. Inspect the Float Model

4.Quantise the model

5. Compile the model for the FPGA (Here the model is combiled for Kria KV260, DPU DPUCZDX8G)

FPGA Setup and Inference Deployment Guide

4. Set Up the FPGA Board

5. Deploy and Run Inference

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages