Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml
AI 摘要
论文展示了在抗辐射FPGA上实现低延迟机器学习应用,并扩展hls4ml工具以支持此类FPGA。
主要贡献
- 开发轻量级自编码器压缩时间读数
- 引入硬件感知的量化策略,降低模型权重
- 扩展hls4ml支持抗辐射FPGA
方法论
通过自编码器压缩数据,结合量化策略降低模型复杂度,并使用hls4ml工具生成FPGA可执行代码。
原文摘要
This paper presents the first demonstration of a viable, ultra-fast, radiation-hard machine learning (ML) application on FPGAs, which could be used in future high-energy physics experiments. We present a three-fold contribution, with the PicoCal calorimeter, planned for the LHCb Upgrade II experiment, used as a test case. First, we develop a lightweight autoencoder to compress a 32-sample timing readout, representative of that of the PicoCal, into a two-dimensional latent space. Second, we introduce a systematic, hardware-aware quantization strategy and show that the model can be reduced to 10-bit weights with minimal performance loss. Third, as a barrier to the adoption of on-detector ML is the lack of support for radiation-hard FPGAs in the High-Energy Physics community's standard ML synthesis tool, hls4ml, we develop a new backend for this library. This new back-end enables the automatic translation of ML models into High-Level Synthesis (HLS) projects for the Microchip PolarFire family of FPGAs, one of the few commercially available and radiation hard FPGAs. We present the synthesis of the autoencoder on a target PolarFire FPGA, which indicates that a latency of 25 ns can be achieved. We show that the resources utilized are low enough that the model can be placed within the inherently protected logic of the FPGA. Our extension to hls4ml is a significant contribution, paving the way for broader adoption of ML on FPGAs in high-radiation environments.