sec-firmwareai
Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
sec-firmwareai [2024/01/18 03:03] – created nuicise | sec-firmwareai [2024/01/18 03:05] (current) – [Learning Objectives:] nuicise | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Section: Firmware and AI ====== | ====== Section: Firmware and AI ====== | ||
Lecturer: Dr. Nhan Tran (Fermilab, US) | Lecturer: Dr. Nhan Tran (Fermilab, US) | ||
+ | |||
In this module, students will learn how to train an ML algorithm for an experimental physics task using Kears and TensorFlow software packages. They will be taught to design their algorithm satisfying the latency and throughput requirements and at the same time comply with the resource constraints. Students will apply quantization-aware training and parameter pruning to compress the model, making it faster and more efficient, while maintaining an acceptable accuracy. Finally, students will use the HLS4ML Python library to deploy the algorithm on a PYNQ-Z2 FPGA development board | In this module, students will learn how to train an ML algorithm for an experimental physics task using Kears and TensorFlow software packages. They will be taught to design their algorithm satisfying the latency and throughput requirements and at the same time comply with the resource constraints. Students will apply quantization-aware training and parameter pruning to compress the model, making it faster and more efficient, while maintaining an acceptable accuracy. Finally, students will use the HLS4ML Python library to deploy the algorithm on a PYNQ-Z2 FPGA development board | ||
- | Learning Objectives: | + | |
+ | ===== Learning Objectives: | ||
By the end of the course, the participants will be able to: | By the end of the course, the participants will be able to: | ||
- | | + | * Train a neural network using Keras and TensorFlow |
- | • Convert a trained neural network into FPGA firmware using HLS4ML | + | |
- | • Optimize a neural network and its resource utilization for deployment onto an FPGA 4. Deploy a neural network onto an FPGA board | + | |
- | • Introduction to Machine Learning on FPGAs | + | |
- | • Basic overview of FPGAs and their underlying structure. Rationale, Motivation, and trade-offs of using FPGAs for Machine Learning. | + | * Basic overview of FPGAs and their underlying structure. Rationale, Motivation, and trade-offs of using FPGAs for Machine Learning. |
- | • Overview of common Neural Network acceleration techniques and hardware, including GPU Acceleration, | + | * Overview of common Neural Network acceleration techniques and hardware, including GPU Acceleration, |
- | • Considerations and parameters to tune when implementing a neural network on a FPGA. Including parallelism and the trade-off between latency and resource utilization, | + | * Considerations and parameters to tune when implementing a neural network on a FPGA. Including parallelism and the trade-off between latency and resource utilization, |
- | • Using HLS4ML to convert a Neural network into FPGA Firmware | + | * Using HLS4ML to convert a Neural network into FPGA Firmware |
- | • Introduction to using the HLS4ML package, basic configuration, | + | * Introduction to using the HLS4ML package, basic configuration, |
- | • Tuning the details of the implemented model, such as parallelism and precision, performing Post-Training Quantization, | + | * Tuning the details of the implemented model, such as parallelism and precision, performing Post-Training Quantization, |
- | • Advanced configuration of implementation parallelism, | + | * Advanced configuration of implementation parallelism, |
- | • Simulation, profiling and evaluation of a model before firmware generation. | + | * Simulation, profiling and evaluation of a model before firmware generation. |
- | • Optimizing your neural network for deployment onto an FPGA | + | * Optimizing your neural network for deployment onto an FPGA |
- | • Overview of common model compression techniques, including Quantization Aware Training (QAT), Parameter Pruning, and Knowledge Distillation. | + | * Overview of common model compression techniques, including Quantization Aware Training (QAT), Parameter Pruning, and Knowledge Distillation. |
- | • A survey of commonly used Quantizaton Aware Training tool kits, their differences, | + | * A survey of commonly used Quantizaton Aware Training tool kits, their differences, |
- | • Example and walkthrough of model pruning, and how to configure and convert a pruned model with hls4ml. Also an example and discussion of how to combine quantization and pruning, its effects on a model, and an example of converting a quantized and pruned model with hls4ml. | + | * Example and walkthrough of model pruning, and how to configure and convert a pruned model with hls4ml. Also an example and discussion of how to combine quantization and pruning, its effects on a model, and an example of converting a quantized and pruned model with hls4ml. |
- | • Deployment and the PYNQ software stack | + | * Deployment and the PYNQ software stack |
- | • Overview of Xilinx’s “PYNQ” Python API and OS Image, basic usage of PYNQ to interact with, manage, and configure supported devices, such as Xilinx’s “ZYNQ/ | + | * Overview of Xilinx’s “PYNQ” Python API and OS Image, basic usage of PYNQ to interact with, manage, and configure supported devices, such as Xilinx’s “ZYNQ/ |
- | • A discussion and overview of developing/ | + | * A discussion and overview of developing/ |
- | • Deployment of a hls4ml generated firmware image onto a TUL Pynq-Z2 development board, running neural network inferences on the FPGA accelerator via the “PYNQ” API and OS, and an example of running the same project on an “ALVEO” device. | + | * Deployment of a hls4ml generated firmware image onto a TUL Pynq-Z2 development board, running neural network inferences on the FPGA accelerator via the “PYNQ” API and OS, and an example of running the same project on an “ALVEO” device. |
+ | |||
+ | ===== Prerequisites: | ||
- | Prerequisites: | ||
Required for this course: Intermediate experience with the Python programming language, basic understanding of Machine Learning/ | Required for this course: Intermediate experience with the Python programming language, basic understanding of Machine Learning/ | ||
sec-firmwareai.1705572213.txt.gz · Last modified: 2024/01/18 03:03 by nuicise